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5 IWUNOREACTIVE HEPATITIS C VIRUS POLYPEPTIDE COMPOSITIONS 

Tpghnical Field 

This invention relates generally to 
imntunoreactive polypeptide compositions, methods of using 
10 the compositions in immunological applications, and 
materials and methods for making the compositions. 

Background . 

The hepatitis C virus has been recently 

15 identified as the major causative agent of post- 
transfusion Non-A f Non-B hepatitis (NANHB) , as well as a 
significant cause of community- acquired NANBH* 
Materials and methods for obtaining the viral genomic 
sequences are known. See, e.g. PCT Publication Nos. 

20 WO89/04669, WO90/11089 WO90/14436. 

Molecular characterization of the HCV genome 
indicates that it is a RNA molecule of positive polarity 
containing approximately 10,000 nucleotides that encodes 
a polyprotein of about 3011 amino acids. Several lines 

25 of evidence suggest that HCV has a similar genetic 

organization to the viruses of the family Flaviviridae, 
which includes the fflavi- and pestivirus. Like its 
pesti- and flaviviral relatives, HCV appears to encode a 
large polyprotein precursor from which individual viral 

30 proteins (both structural and non- structural) are 
processed. 

RNA- containing viruses cam have relatively 
high rates of spontaneous mutation, i.e., reportedly on 
the order of 10* 3 to 10" 4 per incorporated nucleotide. 
35 Therefore, since heterogeneity and fluidity of genotype 
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are common in RNA viruses, there may.be multiple viral 
isolates, which may be virulent or avirulent, within the 
HCV species. 

A number of different isolates of HCV have now 
5 been identified. The sequences of these isolates 

demonstrate the limited heterogeneity characteristic of 
RNA viruses. 

Isolate HCV Jl.l is described in Kubo, Y. et 
al. (1989), Japan. Nucl. Acids Res. 12:10367-10372; 
10 Takeuchi, K. et al.(1990), Gene 21:287-291; Takeuchi et 
al. (1990), J. Gen. Virol. 21:3027-3033; Takeuchi et al . 

(1990) , Nucl. Acids Res. lfi:4626. 

The complete coding sequences plus the 5'- and 
3' -terminal sequences of two independent isolates, 
15 "HCV- J" and "BK", are described by Kato et al. and 

Takamizawa et al, respectively. (Kato et al. (1990), 
Proc. Natl. Acad. Sci. USA £2:9524-9528; Takamizawa et al 

(1991) , J. Virol. ££:1105-1113.) 

Other publications describing HCV isolates are 
20 the following; 

"HCV-1": Choo et al (1990), Brit. Med. 
Bull. ±£:423-441; Choo et al . (1991), Proc. 
Natl. Acad. Sci. USA 44:2451-2455; Han et al. 
(1991), Proc. Natl. Acad. Sci. USA 44:1711- 
25 1715; European Patent Publication No. 318,216. 

•HC-Jl- and -HC-J4": Okamoto et al. 
(1991), Japan J. Exp. Med. £fl:167-177. 

■HCT 18-, "HCT 23", "Th", "HCT 27", "EC1" 
and -BCIO": Weiner et al. (1991), Virol. 
30 114:842-848. 

"Pt-l", -HCV-K1- and "HCV-K2" : Bnomoto et 
al, There are two major types of hepatitis C 
virus in Japan. Division of Gastroenterology, 
Department of Internal Medicine, Kanazawa 
35 Medical University, Japan. 
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Clones "A", m C m , "D" & *E m : Tsukiyama- 
Kohara et ah, As cond group of hepatitis 
virus, in Vims Gengs. 

A typical approach to diagnostic and vaccine 
strategy is to focus on conserved viral domains. This 
approach, however, suffers from the disadvantage of 
ignoring important epitopes that may lie in variable 
domains. 

It is an object of this invention to provide 
polypeptide compositions that are immunologically cross - 
reactive with multiple HCV isolates, particularly with 
respect to heterogeneous domains of the virus. 

Summary of the Invention 

According to an aspect of the present invention 
there exists an immunogenic polypeptide composition which 
comprises at least two HCV amino acid sequences, each HCV 
sequence comprising at least one epitope within a 
variable domain of an HCV envelope protein, wherein the 
variable domain regions of the amino acid sequences are 
heterogeneous with each other and are derived from 
distinct HCV isolates. 

According to another aspect of the present 
invention there exists a method of detecting antibodies 
to HCV within a biological sample which comprises (a) 
providing a biological sample suspected of containing 
antibodies to multiple strains of HCV; (b) providing a 
polypeptide reagent; (c) reacting the biological sample 
of (a) with the polypeptide reagent of (b) under 
conditions which allow the formation of antigen-antibody 
complexes; and (d) detecting the formation of complexes 
formed between the antigen of (a) and the antibodies of 
the biological sample of (b) , if any; characterized in 
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that as a polypeptide reagent it contains a 
immunoreactive composition which comprises at least two 
HCV amino acid sequences, each HCV sequence comprising at 
least one epitope within a variable domain of an HCV 
envelope protein, wherein the variable domain regions of 
the amino acid sequences are heterogeneous with each 
other and are derived from distinct HCV isolates. 

According to yet another aspect of the present 
invention there exists a DNA molecule encoding a 
polypeptide which comprises two heterogeneous amino acid 
sequences from the same variable domain of distinct HCV 
isolates. 



5 
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Brief Description of tha Pimir^ 

Figure 1 schematically shows the genetic 
organization of the HCV genome. 

Figure 2 shows a comparison of the deduced 
amino acid sequences of the Bl protein encoded by group I 
and group II HCV isolates. 

Figure 3 shows a comparison of the amino acid 
sequences of the putative E2/NS1 region of HCV isolates. 

Figure 4 are graphs showing the antigenicity 
profiles for the amino- terminal region of the putative 
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HCV E2/NS1 protein (amino acids 384-420), and the gp 120 
V3 hypervariable region of HIV-l. 

Figure 5 shows a series of graphs which give 
the percentage probabilities that a given residue from 
5 the amino- terminal region of HCV E2/NS1 protein (amino 
acids 384 to 420) will be found in either alpha-helix, 
beta -sheet or beta -turn secondary structural motif . 

Figure 6 are bar graphs showing the reactivity 
of antibodies in the plasma from HCV 18 (panels A-C) or 
10 Th (Panels D-f) with overlapping biotinylated 8mer 

peptides derived from amino acids 384 to 415 or 416 of 
HCV isolates HCT 18 (A, D) , Th (B , E) and HCV Jl (C,F), 
respectively. 

Figure 7 shows the deduced amino acid sequences 
15 of two regions of the E2/NS1 polypeptide, amino acids 
3 84-414 and 547-647, given for the Ql and Q3 isolates. 

Figure 8 A shows the deduced amino acid 
sequences of isolates HCV Jl.l and jr. 2 from amino acids 
384 to 647. Figure 8B shows the deduced amino acid 
20 sequences of isolates HCT27 and HCVE1 from amino acids 
384 to 651. 

Figure 9 shows the entire polyprotein sequence 
of isolate HCV-1. 

25 Maflsa at Practicing the Invention 

The practice of the present invention will 
employ, unless otherwise indicated, conventional 
techniques of molecular biology, microbiology, 
recombinant DNA, and inrounology, which are within the 

30 skill of the art. Such techniques are explained fully in 
the literature. See e.g., Maniatis, Fitsch & Sambrook, 
MOLECULAR CLONING; A LABORATORY MANUAL (2nd ed. 1989) ; 
DNA CLONING, VOLUMES I AND II (D.N Glover ed. 1985); 
OLIGONUCLBOTIDB SYNTHESIS (M.J. Gait ed, 1984); NUCLEIC 

35 ACID HYBRIDIZATION (B.D. Hames & S.J. Higgins eds. 1984); 
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TRANSCRIPTION AND TRANSLATION (B.D. Hames & S.J, Higgins 
eds. 1984); ANIMAL CELL CULTURE (R.I. Freshney ed. 1986); 
IMMOBILIZED CELLS AND ENZYMES (IRL Press, 1986); B. 
Perbal, A PRACTICAL GUIDE TO MOLECULAR CLONING (19 84) ; 
5 the series, METHODS IN ENZYMOLOGY (Academic Press, Inc.); 
GENE TRANSFER VECTORS FOR MAMMALIAN CELLS (J.H. Miller 
and M.P.' Calos eds. 1987, Cold Spring Harbor Laboratory) , 
Methods in Enzymology Vol. 154 and Vol. 155 (Wu and 
Grossman, and Wu, eds., respectively), Mayer and Walker, 

10 eds. (1987) , IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR 
BIOLOGY (Academic Press, London), Scopes, (1987), PROTEIN 
PURIFICATION: PRINCIPLES AND PRACTICE, Second Edition 
(Springer-Verlag, N.Y.), and HANDBOOK OF EXPERIMENTAL IM- 
MUNOLOGY, VOLUMES X-XV (D.M. Weir and C. C. Blaclcwell eds 

15 1986); IMMUNOASSAY: A PRACTICAL GUIDE (D.W. Chan ed. 

1987) . All patents, patent applications, and publica- 
tions mentioned herein, both above and below r are 
incorporated by reference herein. 

HCV is a new member of the Family Flaviviridae 

20 which includes the pestiviruses (Hog Cholera Virus and 
Bovine Viral Diarrhea Virus) and the Flaviviruses, 
examples of which are Dengue and Yellow Fever Virus. A 
scheme of the genetic organization of HCV is shown in 
Figure 1. Similar to the flavi- and pestiviruses, HCV 

25 appears to encode a basic polypeptide domain <"C") at the 
N- terminus of the viral polyprotein followed by two 
glycoprotein domains {-El-, "B2/NS1"), upstream of the 
nonstructural genes NS2 through NS5. The amino acid 
coordinates of the putative protein domains are shown in 

30 Table 1. 
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Table 1. The Putative Protein Domains in HCV 

a. ft. coordinates [aBBggginataJ protein 

1 - 191 c 

192 - 383 E1 

384 - 750 E2/NS1 

751 - 1006 NS2 

1007 - 1488 NS3 

1489 - 1959 NS4 

1960 - 3011 NS5 



As discussed above, a number of HCV isolates 
have been identified. Comparative sequence analysis of 
complete and partial HCV sequences indicates that based 
upon homology at the nucleotide and amino acid levels, 
15 HCV isolates can be broadly sub -divided into at least 
three basic groups (Table 2). See Houghton et al., 
(1991) Hepatology 11:381-388 . However, only partial 
sequence is available for the isolates in group III. 
Therefore, when the sequences of these isolates are more 
defined, one or more of these isolates may deserve 
separation into a different group, including a potential 
fourth group. Table 3 shows the sequence homologies 
between individual viral proteins of different HCV 
isolates as deduced from their nucleotide sequences. It 
can be seen that the proteins of the same virus group 
exhibit greater sequence similarity than the same 
proteins encoded by different virus groups (Table 3 ) . 
One exception to this is the nucleocapsid protein that is 
highly conserved among all group I and II viral isolates 
30 sequences to date. (In Table 3. the symbol N/A signifies 
that the sequences were not available for comparison.) 
For purposes of the present invention, therefore, group I 
isolates can be defined as those isolates having their 
viral proteins, particularly El and E2/NS1 proteins, 
35 about 90% homologous or more at the amino acid level to 
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the isolates classified as group I herein. Group II is 
defined in an analogous manner. Future groups can 
likewise be defined in terras of viral protein homology to 
a prototype isolate. Subgroups can also be defined by 
homology in limited proteins, such as the El, E2/NS1 or 
NS2 proteins, or by simply higher levels of homology. 



Table 2 



n a «if<r»tinn of henatitis 


c viral 


genome RNA 


sAmiences into three basic QrOUPS 


HCV I 


HCY II 


HCY III 


HCV-1 


HCV-J1.1 


Clones A.CD&E 


HC-J1 


HC-J4 


HCV-K2 (afcb) 


HCT 18 


HCV-J 




HCT 23 


BK 




Th 


HCV-K1 




HCT 27 






EC1 






Pt-1 







20 

Table 3. Amino Acid Homologies (%) Between Viral 

Proteins Encoded by Different HCV isolates 
HCV C £1 E2/WS1 HSi HSi HS1 BSS 

Group 

25 T r-oTTTna-red to 

I 98-100 94-100 N/A N/A N/A N/A 99-100 

II 97-98 77-79 78-81 75-77 91-92 90-93 84-88 

III N/A N/A N/A N/A 86 76-80 71-74 

30 TT eomnared to 

II 98-100 92-100 89-100 93-100 94-100 97-100 95-100 
III N/A N/A N/A N/A 84 76 74-75 



35 



TTT coipp*T-ed to 

III N/A N/A N/A N/A N/A 91-100 89-100 
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It is noteworthy that the putative viral 
envelope proteins encoded by the El and E2/NS1 genes show 
substantial amino acid sequence variation between groups 
5 I and II. Only NS2 exhibits a greater degree of 

heterogeneity, while the C, NS3 , NS4 and NS5 proteins all 
show greater sequence conservation between groups . The 
sequence variation observed in the .putative virion 
envelope proteins between groups I and II reflects a 

10 characteristic segregation of amino acids between the two 
groups. An example of this is shown in Figure 2 where 
the sequence of the El gene product is compared between 
viruses of groups. I and II. The El amino acid sequences 
deduced from nucleotide sequences of HCV groups I and II 

15 are shown. In the figure, the horizontal bars indicate 
sequence identity with HCV-1. The asterisks indicate 
group- specif ic segregation of amino acids; the group- 
specific residues can be clearly identified. Group I 
sequences are HCV-1, HCT18 r HCT23 f HCT27, and HC-J1. 

20 Group II sequences are HC-J4, HCV- J, HCV Jl.l, and BK. 
Such group-specific segregation of amino acids is also 
present in other gene products including gp72 encoded by 
the E2/NS1 gene. Figure 3 shows the comparative amino 
acid sequence of the putative E2/NS1 region of HCV 

25 isolates which segregate as group I and group II. The 

latter protein also contains an N- terminal hypervariable 
region ("HV") of about 30 amino acids that shows large 
variation between nearly all isolates. See Weiner et al . 
(1991) , supra. This region occurs between amino acids 

30 384 to 414, using the amino acid numbering system of 
HCV-1. 

The putative HCV envelope glycoprotein E2/NS1 
may correspond to the gp53 (BVDV) /gp55 (Hog Cholera Virus) 
envelope polypeptide of the pestiviruses and the NS1 of 
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che f laviviruses , both of which confer protective 
immunity in hosts vaccinated with these polypeptides. 

Striking similarities between the 
hypervariable region ("HV*) and HIV-1 gpl20 V3 domains 
5 with respect to degree of sequence variation, the 
predictive effect of amino acid changes on putative 
antibody binding in addition to the lack of defined 
secondary structure suggest that the HV domain encodes 
neutralizing antibodies. 

!0 The immunogenicity of the domain is shown by 

antibody epitope mapping experiments, described in the 
Examples. The results of these studies suggest that in 
addition to the three major groups of HCV, HV specific 
sub-groups also exist. 

15 Analysis of biological samples from individuals 

with HCV induced NANBH indicate that individuals may be 
carrying two or more HCV variants simultaneously. Two 
co-existing HV variants were found in the plasma of one 
individual, Jl. In addition, partial sequencing of the 

20 gene of an individual With chronic NANBH, who had 

intermittent flares of hepatitis, revealed that the 
individual, Q, was infected with two HCV variants (Ql or 
Q3) . Each variant was associated with only one episode 
of the disease. An ELISA using a Ql or Q3 specific 

25 peptide (amino acids 396-407) showed that Q developed an 
antibody response to the Ql peptide but not the 
corresponding Q3 peptide, suggesting that Q's 
recrudescence of disease was due to the appearance of an 
HV variant. The presence of antibodies to the Ql peptide 

30 but lack of humoral immune response to the Q3 peptide 
during the second episode of disease suggest that 
variation in the HV domain may result from the pressure 
of immune selection. Amino acids 396-407 appear to be 
subject to the greatest selective pressure in the HV 
35 domain. These findings support the thesis that high 
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levels of chronicity associated with .the disease might be 
du to an inadequate immunological host response to HCV 
infection and/or effective viral mechanisms of 
immunological evasion. Moreover r they point to the 
5 E2/NS1 HV region as a genetic region involved in a viral 
escape mechanism and/or an inadequate immunological 
response* mechanism (s) . 

As discussed above, there are several variant 
regions within the HCV genome. One or more of these 

10 regions are most likely involved in a viral escape 

mechanism and/or am inadequate immunological response 
mechanism. Therefore, it is desirable to include in 
compositions for treatment of HCV polypeptides which 
would induce an immunogenic response to these variants. 

15 In that the El and E2/NS1 regions of the genome 

encode putative envelope type polypeptides, these regions 
would be of particular interest with respect to 
immunogenicity . Thus, these regions are amongst those to 
which it would be particularly desirable to induce and/or 

20 increase an immune response to protect am individual 

against HCV infection, and to aid in the prevention of 
chronic recurrence of the disease in infected 
individuals. In addition, these regions would be amongst 
those from which it would be desirable to detect HCV 

25 variants which are arising during the course of 

infection, as well as super- or co-infection by two or 
more variants. 

The present invention describes compositions 
and methods for treating individuals to prevent HCV 

30 infections, and particularly chronic HCV infections. In 
addition, it describes compositions and methods for 
detecting the presence of ant i- HCV antibodies in 
biological samples. This latter method is particularly 
useful in identifying anci-HCV antibodies generated in 

35 response to immunologically distinct HCV epitopes. This 
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method can also be used to study the evolution of 
multiple variants of HCV within an infected individual . 
In the discussion of the invention, the following 
definitions are applicable. 
5 The term "polypeptide" refers to a polymer of 

amino acids and does not refer to a specific length of 
the product; thus, peptides, oligopeptides, and proteins 
are included within the definition of polypeptide. This 
term also does not refer to or exclude post -express ion 

10 modifications of the polypeptide, for example, 

glycosylations , acetylations , phosphorylations and the 
like. Included within the definition are, for example, 
polypeptides containing one or more analogues of an amino 
acid (including, for example, unnatural amino acids, 

15 etc.), polypeptides with substituted linkages, as well as 
other modifications known in the art, both naturally 
occurring and non-naturally occurring. 

As used herein, A is "substantially isolated" 
from B when the weight of A is at least about 70%, more 

20 preferably at least about 80%, and most preferably at 

least about 90% of the combined weights of A and B. The 
polypeptide compositions of the present invention are 
preferably substantially free of human or other primate 
tissue (including blood, serum, cell lysate, cell 

25 organelles, cellular proteins, etc.) and cell culture 
medium. 

A "recombinant polynucleotide" intends a 
polynucleotide of genomic, cDNA, semisynthetic, or 
synthetic origin which, by virtue of its origin or 
30 manipulation: (l) is not associated with all or a portion 
of a polynucleotide with which it is associated in 
nature, (2) is linked to a polynucleotide other than that 
to which it is linked in nature, or (3) does not occur in 
nature. 
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A "polynucleotide" is a polymeric form of 
nucleotides of any length, either ribonucleotides or 
deoxyribonucl otides. This term refers only to the 
primary structure of the molecule . Thus, this term 
5 includes double- and single- stranded DNA and RNA. It 

also includes known types of modifications, for example , 
labels which are known in the art, methylation, "caps", 
substitution of one or more of the naturally occurring 
nucleotides with an analog, internucleotide modifications 

10 such as, for example, those with uncharged linkages 

(e.g., phosphorothioates , phosphorodithioates , etc.) ( 
those containing pendant moieties, such as, for example 
proteins (including for e.g., nucleases, toxins, 
antibodies, signal peptides, poly- L- lysine, etc.), those 

15 with intercalators (e.g., acridine, psoralen, etc.), 
those containing chelators (e.g. , metals, radioactive 
metals, etc. ) , those containing alkylators, those with 
modified linkages (e.g., alpha anomeric nucleic acids, 
etc.), as well as unmodified forms of the polynucleotide. 

20 "Recombinant host cells", "host cells", 

"cells", "cell lines", "cell cultures", and other such 
terms denoting microorganisms or higher eukaryotic cell 
lines cultured as unicellular entities refer to cells 
which can be or have been, used as recipients for a 

25 recombinant vector or other transfer polynucleotide, and 
include the progeny of the original cell which has been 
transfected. It is understood that the progeny of a 
single parental cell may not necessarily be completely 
identical in morphology or in genomic or total DNA. 

3 0 complement as the original parent, due to natural, 
accidental, or deliberate mutation. 

A "replicon" is any genetic element, e.g., a 
plasmid, a chromosome, a virus, a cosmid, etc., that 
behaves as an autonomous unit of polynucleotide 
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replication within a cell; i.e., capable of replication 
under its own control. 

A "vector" is a replicon further comprising 
sequences providing replication and/or expression of the 
5 open reading frame. ^ 

"Control sequence" refers to polynucleotide 
sequences which are necessary to effect the expression of 
coding sequences to which they are ligated. The nature 
of such control sequences differs depending upon the host 

10 organism; in proJcaryotes, such control sequences 

generally include promoter, ribosomal binding site, and 
terminators; in eukaryotes, generally, such control 
sequences include promoters, terminators and, in some 
instances, enhancers. The term "control sequences" is 

15 intended to include, at a minimum, all components whose 
presence is necessary for expression, and may also 
include additional components whose presence is 
advantageous, for example, leader sequences which govern 
secretion. 

20 A "promoter" is a nucleotide sequence which is 

comprised of consensus sequences which allow the binding 
of RNA polymerase to the DNA template in a manner such 
that mRNA production initiates at the normal 
transcription initiation site for the adjacent structural 

25 gene. 

"Operably linked" refers to a juxtaposition 
wherein the components so described are in a relationship 
permitting them to function in their intended manner. A 
control sequence "operably linked" to a coding sequence 
30 is ligated in such a way that expression of the coding 

sequence is achieved under conditions compatible with the 
control sequences. 

An "open reading frame" (ORF) is a region of a 
polynucleotide sequence which encodes a polypeptide; this 
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region may represent a portion of a coding sequence or a 
total coding sequence. 

A "coding sequence" is a polynucleotide 
sequence which is transcribed into mRNA and/or translated 
5 into a polypeptide when placed under the control of 

appropriate regulatory sequences. The boundaries of the 
coding sequence are determined by a translation start 
codon at the 5' -terminus and a translation stop codon at 
the 3' -terminus. A coding sequence can include but is 

10 not limited to mRNA, DNA (including cDNA) , and 
recombinant polynucleotide sequences. 

As used herein, "epitope" or "antigenic 
determinant", means an amino acid sequence that is 
immunoreactive . Generally an epitope consists of at 

15 least 3 to 5 amino acids, and more usually, consists of 
at least about 8, or even about 10 amino acids. As used 
herein, an epitope of a designated polypeptide denotes 
epitopes with the same amino acid sequence as the epitope 
in the designated polypeptide, and immunologic 

20 equivalents thereof. 

An "antigen" is a polypeptide containing one or 
more epitopes. 

"Immunogenic" means the ability to elicit a 
cellular and/or humoral immune response. An immunogenic 

25 response may be elicited by immunoreactive polypeptides 
alone, or may require the presence of a carrier in the 
presence or absence of an adjuvant. 

•Immunoreactive" refers to (1) the ability to 
bind imnunologically to an antibody and/or to a 

30 lymphocyte antigen receptor or (2) the ability to be 
immunogenic. 

An "antibody" is amy immunoglobulin, including 
antibodies and fragments thereof, that binds a specific 
epitope. The term encompasses, inter alia, polyclonal, 

35 monoclonal, and chimeric antibodies. Examples of 
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chimeric antibodies are discussed in U.S. Patent Nos. 
4,816,397 and 4,816,567. 

An "antigen set" is defin d as a composition 
consisting of a plurality of substantially identical 
5 polypeptides, wherein the polypeptides are comprised of 
an amino acid sequence of one defined epitope. 

"Substantially identical polypeptides" means 
polypeptides that are identical with the exception of 
variation limited to the typical range of sequence or 

10 size variation attributable to the polypeptide's method 
of production; e.g., recombinant expression, chemical 
synthesis, tissue culture, etc. This variation does not 
alter the desired functional property of a composition of 
substantially identical polypeptides; e.g., the 

15 composition behaves immunologically as a composition of 
identical polypeptides. The variations may be due to, 
for example, alterations resulting from the secretory 
process during transport of the polypeptide, less than 
100% efficiency in chemical synthesis, etc. 

20 As used herein, a "variable domain" or "VD" of 

a viral protein is a domain that demonstrates a 
consistent pattern of amino acid variation between at 
least two HCV isolates or subpopulations . Preferably, 
the domain contains at least one epitope. Variable 

25 domains can vary from isolate to isolate by as little as 
1 amino acid change. These isolates can be from the same 
or different HCV group (s) or subgroup (s) . Variable 
domains can be readily identified through sequence 
composition among isolates, and examples of these 

30 techniques are described below. For the purposes of 

describing the present invention, variable domains will 
be defined with respect to the amino acid number of the 
polyprotein encoded by the genome of HCV-i as shown in 
Figure 9, with the initiator methionine being designated 

35 position 1. The corresponding variable domain in another 
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HCV isolate is determined by aligning the two isolates 
sequences in a manner the brings the conserved domains 
outside any variable domain into maximum alignment. This 
can be performed with any of a number of computer 
5 software packages, such as ALIGN 1.0, available from the 
University of Virginia, Department of Biochemistry (Attn: 
Dr. William R. Pearson). See Pearson et al., (1988) 
Proc. Natl. Acad. Sci. USA £5:2444-2448 . It is to be 
understood that the amino acid numbers given for a 

10 particular variable domain are somewhat subjective and a 
matter of choice. Thus, the beginning and end of 
variable domains should be understood to be approximate 
and to include overlapping domains or subdomains, unless 
otherwise indicated. 

15 An epitope is the "immunologic equivalent" of 

another epitope in a designated polypeptide when it 
cross -reacts with antibodies which bind immunologically 
to the epitope in the designated polypeptide. 

Epitopes typically are mapped to comprise at 

20 least about five amino acids, sometimes at least about 8 
amino acids, and even about 10 or more amino acids. 

The amino acid sequence comprising the HCV 
epitope may be linked to another polypeptide (e.g., a 
carrier protein) , either by covalent attachment or by 

25 expressing a fused polynucleotide to form a fusion 

protein. If desired, one may insert or attach multiple 
repeats of the epitope, and/or incorporate a variety of 
epitopes. The carrier protein may be derived from any 
source, but will generally be a relatively large, 

30 innnunogenic protein such as BSA, KLH, or the like. If 
desired, one may employ a substantially full-length HCV 
protein as the carrier, multiplying the number of 
ixmnunogenic epitopes. Alternatively, the amino acid 
sequence from the HCV epitope may be linked at the amino 

35 terminus and/or carboxy terminus to a non-HCV amino acid 
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sequence, thus the polypeptide would be a "fusion 
polypeptide". Analogous types of polypeptides may be 
constructed using epitopes from other designated viral 
proteins . 

5 A "variant" of a designated polypeptide refers 

to a polypeptide in which the amino acid sequence of the 
designated polypeptide has been altered by the deletion, 
substitution, addition or rearrangement of one or more 
amino acids in the sequence. Methods by which variants 
10 occur (for example, by recombination) or are made (for 
example, by site directed mutagenesis) are lenown in the 
art. 

"Transformation" refers to the insertion of an 
exogenous polynucleotide into a host cell, irrespective 

15 of the method used for the insertion, for example, direct 
uptake, transduction (including viral infection) , f - 
mating or electroporation. The exogenous polynucleotide 
may be maintained as a non- integrated vector, for 
example, a plasmid or viral genome, or alternatively, may 

20 be integrated into the host genome. 

An "individual" refers to a vertebrate, 
particularly a member of a mammalian species, and 
includes but is not limited to rodents (e.g., mice, rats, 
hamsters, guinea pigs) , rabbits, goats, pigs, cattle, 

25 sheep, and primates (e.g., chimpanzees , Afri cam Green 
Monkeys, baboons , orangutans, and humans) . 

As used herein, "treatment" refers to any of 
(i) the prevention of infection or reinfection, as in a 
traditional vaccine, (ii) the reduction or elimination of 

30 symptoms, and (iii) the substantial or complete 

elimination of the virus. Treatment may be effected 
prophylactically (prior to infection) or therapeutically 
(following infection) . 

The term "effective amount" refers to an amount 

35 of epitope- bearing polypeptide sufficient to induce an 
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immunogenic response in the individual to which it is 
administered, or to otherwise detectably immunoreact in 
its intended system (e.g., immunoassay). Preferably, the 
effective amount is sufficient to effect treatment, as 
5 defined above. The exact amount necessary will vary from 
application. For vaccine applications or in the 
generation of polyclonal ant iserum/ antibodies , for 
example, the effective amount may vary depending on the 
species, age, and general condition of the individual, 
10 the severity of the condition being treated, the 
particular polypeptide selected and its mode of 
administration, etc. It is also believed that effective 
amounts will be found within a relatively large, non-' 
critical range. An appropriate effective amount can be 
15 readily determined using only routine experimentation. 

As used herein, a "biological sample" refers to 
a sample of tissue or fluid isolated from an individual , 
including but not limited to, for example, plasma, serum, 
spinal fluid, lymph fluid, the external sections of the 
20 sJcin, respiratory, intestinal, and genitourinary tracts, 
tears, saliva, milk, blood cells, tumors, organs, 
biopsies and also samples of in vitro cell culture 
constituents (including but not limited to conditioned 
medium resulting from the growth of cells in cell culture 
25 medium, e.g., Mab producing myeloma cells, recombinant 
cells, and cell components). 

The immunoreact ive polypeptide compositions of 
the present invention comprise a mixture of isolate- or 
group- specific epitopes from at least one HCV VD. Thus, 
30 there will be present at least two heterogeneous amino 

acid sequences each defining an epitope found in distinct 
HCV isolates located in the same or substantially same 
physical location in an HCV protein; i.e. each sequence 
maps to the same location within the HCV 
35 genome/polypeptide. Since the sequences are 
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. heterogeneous, the location is referred to as a variable 
domain (VD) . 

To better understand the invention, first the 
individual amino acid sequences that make up the 
5 compositions of the invention will be explained. Then 
the plurality of such sequences which are found in the 
compositions of the present invention will be discussed. 

The amino acid sequence that characterizes the 
polypeptides of the present invention have a basic 
10 structure as follows: 

VZ-L' y . (I) 
2 represents the amino acid sequence from a region of a 
protein from. a selected HCV isolate, where the region 
comprises at least one variable domain and the variable 

15 domain comprises at least one epitope. L and L' are 

non-HCV amino acid sequences or HCV amino acid sequences 
that do not contain a variable domain, and which can be 
the same or different, y and y' are 0 or 1 and can be 
the same or different. Thus, formula I represents an 

20 amino acid sequence comprising the sequence of an HCV VD, 
wherein the VD comprises an epitope. 

As discussed above, the epitope (s) in Z will 
usually comprise a minimum of about 5 amino acids, more 
typically a minimum of about 8 amino acids, and even more 

25 typically a minimum of about 10 amino acids. 

The variable domain of Z can comprise more than 
one epitope. The variable domain of Z is at least as big 
as the combined sequences of the epitopes present, thus 
making it typically a minimum of about 5 amino acids when 

30 a single epitope is present. Since epitopes can overlap, 
the minimum amino acid sequence for combined epitopes in 
the variable domain may be less than the sum of the 
individual epitopes' sequences. 

Z is the amino acid sequence of an HCV isolate 

35 comprising the above -described VD. Thus, the minimum 
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size of Z is the minimum size of the ;VD. z can comprise 
more HCV amino acid sequence than just the VD, and can 
further comprise more than one VD. The maximum size of Z 
is not critical, but obviously cannot exceed the length 
5 of the entire HCV polyprotein. Typically, however, Z 
will be the sequence of an entire HCV protein 
(particularly El, E2/NS1, NS2, NS3, NS4 and NS5) or, even 
more typically, a fragment of such an HCV protein. Thus, 
Z will preferably range from a minimum of about 5 amino 
10 acids (more preferably about 8 or about 10 amino acids 
minimum) to a maximum of about 1100 amino acids (more 
preferably a maximum of about 500, more preferably a 
maximum of about. 400 or even more preferably a maximum of 
about 200 amino acids maximum) . More usually, the 
15 polypeptide of formula I and/or Z, when prepared by, 

e.g., chemical synthesis, is a maximum of about 50 amino 
acids, more typically a maximum of about 4*0 amino acids, 
and even more typically a maximum of about 30 amino 
acids. 

20 The non-HCV amino acid sequences, L and L' , if 

present, can constitute any of a number types of such 
sequences. Por example, L and L' can represent non-HCV 
sequences to which Z is fused to facilitate recombinant 
expression (e.g., beta-galactosidaae, superoxide 

25 dismutase, invertaae, alpha-factor, TPA leader, etc.), as 
discussed below. Alternatively, L and L' can represent 
epitopes of other pathogens, such as hepatitis B virus, 
Bordetella pertussis, tetanus toxoid, diphtheria, etc., 
to provide compositions that are imnunoreactive relative 

30 to a number these other pathogens. L and L' can be amino 
acid sequences that facilitate attachment to solid 
supports during peptide synthesis, immunoassay supports, 
vaccine carrier proteins, etc. In fact, L and L' can 
even comprise one or more superfluous amino acids with no 

35 functional advantage. There is no critical maximum size 
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f or L or L' , the length being generally governed by the 
d sired function. Typically, L and L' will each be a 
maximum of about 2000 amino acids, more typically a 
maximum of about 1000 amino acids. The majority of L and 
5 L' sequences with useful properties will be a maximum of 
about 500 amino acids. It is desirable, of course, to 
select L and L' so as to not block the immunoreactivity 
of Z. 

The composition of polypeptides provided 

10 according to the present invention are characterized by 
the presence (in an effective amount for 
immunoreactivity) within the composition .of at least two 
amino acid sequences defined as follows by formulas II 
and III, respectively: 

15 VZ r L' y . (ID 

VZ 2 -L' y . (III) 
L, L' , y and y 4 are defined as above, as well as 
independently defined for each of formulas II and III. 
Z, and Zj are each HCV amino acid sequences as defined for 

20 z above encompassing the same variable domain (i.e., 
physical location) , but derived from different HCV 
isolates having between them at least one heterogeneous 
epitope in the common variable domain of Z, and Z 2 . As an 
illustrative example, an amino acid sequence according to 

25 formula II could have as Z, a fragment the hypervariable 
domain spanning amino acids 384-414 of isolate HCV-l (or 
more particularly 396-407 or 396-408), while Z, is the 
analogous fragment from isolate HCV-ji.i. These two 
isolates are heterogeneous in this domain, the amino acid 

3° sequences of the epitopes varying significantly. 

It is to be understood that the compositions of 
the present invention may comprise more than just two 
discrete amino acid sequences according to formula I, and 
that the Z sequences may be divided into groups 

3 5 encompassing different variable domains. For example, a 
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composition according to the present inv ntion could 
coxnpris a group of HCV sequences (with amino acid 
sequences according to formula I) encompassing the 
hypervariable domain at amino acids 3 84-411 from isolates 
5 HCV-1, HCV-J1.1, HC-Jl, HC-J4, etc. The composition 

could also comprise an additional group of HCV sequences 
(within amino acid sequences according to formula I) 
encompassing the variable domain at amino acids 215-25 5 
also from isolates HCV-1, HCV-J1.1, HC-Jl, HC-J4, etc. 
10 Within the context of the compositions of the present 
invention, therefore, the sequence of formula I can be 
further defined as follows: 

SV 0 (IV) 
V represents an amino acid sequence comprising the 
15 sequence of an HCV variable domain, wherein the variable 
domain comprises at least one epitope; i.e., formula I. 
S and n are integers of 1 or greater. S represents a 
particular variable domain, and n represents a particular 
isolate. For example, S-l could represent the variable 
20 domain at amino acids 384-411? S-2 could represent the 

variable domain at amino acids 215-255; and n-1, 2, 3 and 
4 could represent isolates HCV-l, HCV-J1.1, HC-Jl and HC- 
J4, respectively. Thus, the two groups of sequences 
discussed above could be represented by: 
25 Group 1: lV lf 1V„ 1V 3 & 1V 4 

Group 2: 2V, , 2V 5 , 2V, 6 2V 4 
There are at least two distinct sequences of 
formula IV in the compositions according to the present 
invention; i.e-, the composition contains two different 
30 sequences according to formula IV where the values for S 
and or n are different. Por example, at least IV, and iv, 
are present, or at least IV, and 2V, are present, or at 
least IV, and 2V, are present. 

The distinct sequences falling within formula 
35 rv are present in the composition either on the same or 



WO 93/06126 2116764 PCT/LS92/07683 

** 2 5 ~ 

different polypeptide molecules. Using the minimum 
combination of IV, and 1V 2 to illustrate, these two 
sequences could be present in the same polypeptide 
molecule (e.g., 1V,-1V 2 ) or in separate molecules- This 
5 feature of the compositions of the present invention can 
be described as compositions of polypeptides as follows: 

R, - ( S V, ) E - R ' r (V) 
wherein S, V and n are as defined above; R and R' are 
amino acid sequences of about 1-2000 amino acids, and are 

10 the same or different; r and r' are 0 or 1, and are the 
same or different; x is an integer > 1; n is 
independently selected for each x; and with the proviso 
that am ino acid sequences are present in the composition 
representing a combination selected from the group 

15 consisting of (i) IV, and lVj, (ii) IV, and 2V 2 , and (iii) 
IV! and 2V,. In embodiments where the distinct sequences 
of formula IV are in different polypeptides, x can be l, 
although it can still be >l if desired; e.g., a mixture 
of polypeptides 1V,-1V 2 and 1V,-2V 2 . When x is 1, r and 

20 r' are preferably both 0 to avoid redundancy with and 
L' y . # since V can be described by in a preferred 
embodiment by formula I. When x is >1, the combined 
lengths of R and the adjacent L, and of R' and the 
adjacent L' , are preferably no more than the typical 

25 maximum lengths described above for L and L' . 

The selection of the HCV amino acid sequences 
included within the distinct V sequences of the 
compositions will depend upon the intended application of 
the sequences and is within the skill of the art in view 

30 of the present disclosure* First, it should be 

appreciated that the HCV epitopes of concern to the 
present invention can be broken down into two types. The 
first type of epitopes are those that are • group - 
specific"; i.e., the corresponding epitopes in all or 

35 
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substantially all isolates within an HCV isolate group 
are immunologically cross- reactive with each other, but 
not with the corresponding epitopes of substantially all 
the isolates of another group. Preferably, the epitopes 
5 in a group- specif ic class are substantially conserved 
within the group, but not between or among the groups. 
The second type of epitopes are those that are " isolate - 
specific"; i.e., the epitope is immunologically cross- 
reactive with substantially identical isolates, and is 
10 not cross- reactive with all or substantially all distinct 
isolates. 

These group- and isolate-specif ic epitopes can 
be readily identified in view of the present disclosure. 
First, the sequences of several HCV isolates is compared, 
15 as described herein, and areas of sequence heterogeneity 
identified. The pattern of heterogeneity usually 
indicates group or isolate specificity. If an identified 
area is known to comprise one or more epitopes, then a 
sequence of sufficient size to include the desired 
20 epitope (s) is selected to as an variable domain that may 
be included in the compositions of the present invention. 
If the iimnunoreactivity of a given heterogeneous area is 
not known, peptides representing the sequences found in 
that area of the various HCV isolates can be prepared and 
25 screened- Screening can include, but is* not limited too, 
inraunoassays with various sources of anti-HCV antibody 
(e.g.* patient serum, neutralizing Mabs, etc.) or 
generation of antibody and testing the ability of such 
antibody to neutralize virus in vitro. Alternatively, 
30 the loci of epitopes identified in a screening protocol, 
euch as that described below, can be examined for 
heterogeneity among various isolates and the 
iromnological properties of corresponding heterogeneous 
sequences screened, 

35 
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For vaccine applications, it is believed that 
variable domains from the El and/or E2/NS1 domains will 
be of particular interest- In particular, an El variable 
domain within amino acids 215-255 (see Figure 2) , and an 
5 E2/NS1 variable domain within amino acids 384-414 (see 
Figure 3) , have been identified as being important 
ixnmunoreactive domains. The preliminary evidence 
suggests that one or both of these domains may be loci of 
heterogeneity responsible for escape mutants, leading to 

10 chronic HCV infections. Thus, polypeptide compositions 
as described above where the variable domain (s) in V are 
one or both of these variable domains are particularly 
preferred. Furthermore, the polypeptide compositions of 
the present invention, while particularly concerned with 

15 the generally linear epitopes in the variable domains, 
may also include conformational epitopes. For example, 
the composition can be comprised of a mixture of 
recombinant El and/or E2/NS1 proteins (exhibiting the 
variable domains of different isolates) expressed in a 

20 recombinant system (e.g., insect or mammalian cells) that 
maintains conformational epitopes either inside or 
outside the variable domain. Alternatively, an El and/or 
E2/NS1 subunit antigen from a single isolate that 
maintains conformational epitopes can be combined with a 

25 polypeptide composition according to the present 

invention (e.g., a mixture of synthetic polypeptides or 
denatured recombinant polypeptides) . In another 
preferred application for vaccines, the polypeptide 
compositions described herein are combined with other HCV 

30 subunit antigens 
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For diagnostic application, it may be useful to 
employ the compositions of the present invention as 
antigens, thereby improving the ability to detect 
antibody to distinct HCV isolates. Typically the 
5 polypeptide mixtures can used directly in a homogeneous 
or heterogeneous immunoassay format, the latter 
preferably comprising immobilizing the polypeptide on a 
solid substrate (e.g., microtiter plate wells, plastic 
beads, nitrocellulose, etc.). See, e.g.. PCT Pub. No. 
10 WO90/11089; EPO Pub. No. 360.088; IMMUNOASSAY: A 
PRACTICAL GUIDE , supra. Alternatively, each 
substantially identical polypeptide that makes up the 
polypeptide composition of the present invention could be 
immobilized on the same support at discrete loci, thereby 
15 providing information as to which isolate or group the 
antibody has been generated. This may be particularly 
important in diagnostics if various isolates cause 
hepatitis, cancer or other diseases with different 
clinical prognoses. A preferred format is the Chiron 
20 RIBA™ strip immunoassay format, described in commonly 
owned U.S. S.N. 07/138.894 and U.S. S.N. 07/456.637, the 
disclosures of which are incorporated herein by 
reference. 

Polypeptides useful in the manufacture of the 
25 compositions of the present invention can be made 
recombinantly, synthetically or in tissue culture. 
Recombinant polypeptides comprised of the truncated HCV 
sequences or full-length HCV proteins can be made nap 
entirely of HCV sequences (one or more epitopes, either 
30 contiguous or noncontiguous) , or sequences in a fusion 
protein. In fusion proteins, useful heterologous 
sequences include sequences that provide for secretion 
from a recombinant host, enhance the immunological 
reactivity of the HCV epitope ( s) . or facilitate the 
35 coupling of the polypeptide to a support or a vaccine 
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carrier. See, e.g., EPO Pub. No. 116,201; U.S. Pat. No. 

4,722840; EPO Pub. No. 259,149; U.S. Pat. No. 4,629,783. 

Pull length as well as polypeptides comprised 
of truncated HCV sequences, and mutants thereof, may be 
prepared by chemical synthesis- Methods of preparing 
polypeptides by chemical synthesis are known in the art. 
They may also be prepared by recombinant technology. A 
DNA sequence encoding HCV-1, as well as DNA sequences of 
variable regions from other HCV isolates have been 
described and/or referenced herein. The availability of 
these sequences permits the construction of 
polynucleotides encoding immunoreactive regions of HCV 

polypeptides . 

Polynucleotides encoding the desired 
polypeptide comprised of one or more of the 
immunoreactive HCV epitope from a variable domain of HCV 
may be chemically synthesized or isolated, and inserted 
into an expression vector. The vectors may or may not 
contain portions of fusion sequences such as beta- 
Galactosidase or superoxide dismutase (SOD). Methods and 
vectors which are useful for the production of 
polypeptides which contain fusion sequences of SOD are 
described in European Patent Office Publication number 
0196056, published October 1, 1986. 

The DNA encoding the desired polypeptide, 
whether in fused or mature form and whether or not 
containing a signal sequence to permit secretion, may be 
ligated into expression vectors suitable for any 
convenient host- The hosts are then transformed with the 
expression vector. Both eukaryotic and prokaryotic host 
systems are presently used in forming recombinant 
polypeptides, and a summary of some of the more common 
control systems and host cell lines is presented infra. 
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The host cells are incubated under conditions which allow 
expression of the desired polypeptide. The polypeptide 
is then isolated from lysed cells or from the culture 
medium and purified to the extent needed for its intended 
5 use . 

The general techniques used in extracting the 
HCV genoine from a virus, preparing and probing DNA 
libraries, sequencing clones, constructing expression 
vectors, transforming cells, performing immunological 

10 assays such as radioimmunoassays and ELISA assays, for 

growing cells in culture, and the like, are known in the 
art. (See, e.g., the references cited in the "Background" 
section, above, as well as the references cited at the 
beginning of this ("Modes of Practicing the Invention 1 ^ 

15 section above. 

Transformation of the vector containing the 
desired sequence into the appropriate host may be by any 
known method for introducing polynucleotides into a host 
cell, including, for example, packaging the 

20 polynucleotide in a virus and transducing the host cell 
with the virus, or by direct uptake of the 
polynucleotide. The transformation procedure used 
depends upon the host to be transformed. Bacterial 
transformation by direct uptake generally employs 

25 treatment with calcium or rubidium chloride (Cohen 
(1972), Proc. Natl. Acad. Sci. USA £2.: 2110 • Yeast 
transformation by direct uptake may be carried out using 
^ the method of Hinnen et al. (1978), J. Adv. Enzyme 

Reg. 2: 1929. Maxrmalian transformations by direct uptake 

30 may be conducted using the calcium phosphate 

precipitation method of Graham and Van der Eb (1978) , 
Virology 52:546, or the various known modifications 
thereof. Other methods for the introduction of 
recombinant polynucleotides into cells, particularly into 
35 mamnalian cells, which are known in the art include 
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dextran mediated transf ection, calcium phosphate mediated 
transf ection, polybrene mediated transf ection, protoplast 
fusion, electroporation, encapsulation of the 
polynucleotide (s) in liposomes, and direct microinjection 
5 of the polynucleotides into nuclei. 

In order to obtain expression of desired coding 
sequences, host cells are transformed with 
polynucleotides (which may be expression vectors) # which 
are comprised of control sequences operably linked to the 

10 desired coding sequences- The control sequences are 

compatible with the designated host. Among prokaryotic 
hosts, B. coli is most frequently used. Expression 
control sequences for prokaryotes include promoters, 
optionally containing operator portions, and ribosome 

15 binding sites. Transfer vectors compatible with 

prokaryotic hosts are commonly derived from, for example, 
pBR322, a plasmid containing operons conferring 
axnpicillin and tetracycline resistance, and the various 
pUC vectors, which also contain sequences conferring 

20 antibiotic resistance markers. Promoter sequences may be 
naturally occurring, for example, the £- lactamase 
(penicillinase) (Weissman (1981), •The cloning of 
interferon and other mistakes" in interferon 3 (ed. I. 
Gresser) , lactose (lac) (Chang et al. (1977), Nature 

25 19B : 1056) and tryptophan (trp) (Goeddel et al. (1980), 

Nucl. Acids Res- 1:4057), and lambda -derived P u promoter 
system and N gene ribosome binding site (Shixnatake et al . 
(1981), Nature 212:128). In addition, synthetic 
promoters which do not occur in nature also function as 

30 bacterial promoters. For example, transcription 

activation sequences of one promoter may be joined with 
the operon sequences of another promoter, creating a 
synthetic hybrid promoter (e.g., the tac promoter, which 
is derived from sequences of the trp and Jac promoters 

35 (De Boer et al. (1983), Proc. Natl. Acad. Sci. USA 
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fi£:21) - The foregoing systems are particularly 
compatible with £. ££li; if desired, other prokaryotic 
hosts such as strains of Bacillus or Pseudomonas may be 
used, with corresponding control sequences. 
5 Eukaryotic hosts include yeast and maxrcnalian 

cells in culture systems. Sacchar^ny^ a cere yj g j a ^ and 
SflCChaCQmYCfff carlaberaengig are the most commonly used 
yeast hosts, and are convenient fungal hosts. Yeast 
compatible vectors generally carry markers which permit 
selection of successful transf ormants by conferring 
prototropy to auxotrophic mutants or resistance to heavy 
metals on wild- type strains. Yeast compatible vectors 
may employ the 2 micron origin of replication (Broach et 
al. (1983), Meth. Enz. JLQA:307) , the combination of CEN3 
15 and ARSl or other means for assuring replication, such as 
sequences which will result in incorporation of an 
appropriate fragment into the host cell genome. Control 
sequences for yeast vectors are known in the art and 
include promoters for the synthesis of glycolytic enzymes 
20 (Hess et al. (1968), J. Adv. Enzyme Reg. 2:149); for 

example, alcohol dehydrogenase (ADH) (E. P.O. Publication 
No, 284044), enolase, glucokinase, glucose- 6 -phosphate 
isomerase, glyceraldehyde- 3 -phosphate dehydrogenase (GAP 
or GAPDH) , hexokinase, phosphof ructokinase, 3- 
25 glycerophosphate mutase, and pyruvate kinase (PyK) (E.P.O. 
Publication No. 329203). The yeast PH05 gene, encoding 
acid phosphatase, also provides useful promoter 
sequences. In addition, synthetic promoters which do not 
occur in nature also function as yeast promoters. For 
30 example, upstream activating sequences (UAS) of one yeast 
promoter may be joined with the transcription activation 
region of another yeast promoter, creating a synthetic 
hybrid promoter. Examples of such hybrid promoters 
include the ADH regulatory sequence linked to the GAP 
35 transcription activation region (U.S. Patent Nos. 
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4,876,197 and 4,880,734). Other examples of hybrid 
promoters include promoters which consist of the 
r gulatory sequences of either the ADH2 , GAL4 , GAL10, or 
PH05 genes, combined with the transcriptional activation 
5 region of a glycolytic enzyme gene such as GAP or PyK 
(E.P.O. Publication No . 164556). Furthermore, a yeast 
promoter can include naturally occurring promoters of 
non-yeast origin that have the ability to bind yeast ska 
polymerase for the appropriate initiation of 
10 transcription. 

Other control elements which may be included in 
the yeast expression vector are terminators (e.g., from 
GAPDH, and from the enolase gene (Holland (1981), J. 
Biol. Chem. 2££:1385), and leader sequences. The leader 
15 sequence fragment typically encodes a signal peptide 
comprised of hydrophobic amino acids which direct the 
secretion of the protein from the cell. DNA encoding 
suitable signal sequences can be derived from genes for 
secreted yeast proteins, such as the yeast invertase gene 
(E.P.O. Publication No. 12,873) and the a- factor gene 
(U.S. Patent No. 4,588,684). Alternatively, leaders of 
non-yeast origin, such as an interferon leader, also 
provide for secretion in yeast (E.P.O. Publication No. 
60057) . A preferred class of secretion leaders are those 
25 that employ a fragment of the yeast a- factor gene, which 
contains both a "pre" signal sequence, and a "pro" 
region. The types of a- factor fragments that can be 
enqployed include the full-length pre -pro a- factor leader, 
as well as truncated or-factor leaders (U.S. Patent Nos. 
30 4,546,083 and 4,870,008; E.P.O. Publication No. 324274. 
Additional leaders employing an a- factor leader fragment 
that provides for secretion include hybrid cr- factor 
leaders made with a pre-aequence of a first yeast, but a 
pro- region from a second yeast a-factor. (See, e.g., 
35 P.C.T. WO 89/02463) . 



20 
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Expression vectors, either extrachromosomal 
replicons or integrating vectors, have been developed for 
transformation into many yeasts. For example, expression 
vectors have been developed for Candida albicans (Kurtz 
5 et al. (198€), Mol . Cell Biol.£:142), Candida maltosa 
(Kunze et al . (1985) J. Basic Microbiol. 2£:141), 
HanzeilUla polymorph* (Gleeson et al . (1986), J. Gen. 
Microbiol. 122:3459), KlUYYegQmvcgfl fraaili* (Das et al . 
(1984), J. Bacterid. 15£:1165), Kluwgromyr^ lactis (De 
10 Louvencourt et al. (1983), J. Bacterid. 1£±:737), Pichia 
gaillfiZimoaaii. (Kunze et al. (1985), supra), Pichia 
PftgtQriS (Cregg et al. (1985), Mol. Cell. Biol. 5:3376; 
U.S. Patent Nos. 4,837,148 and 4,929,555)), 
SchizosaccharomvcM EfimbS (Beach and Nurse (1961), Nature 
15 2H£:706), and YarrQwja lipolvtica (Davidow et al . (1985), 
Curr. Genet. 1£:39) . 

Mammalian cell lines available as hosts for 
expression are known in the art and include many 
immortalized cell lines available from the American Type 
20 Culture Collection (ATCC) , including, for example, HeLa 
cells, Chinese hamster ovary (CHO) cells, baby hamster 
kidney (BHK) cells, COS monkey cells, and a number of 
other cell lines. Suitable promoters for mammalian cells 
are also known in the art and include viral promoters 
25 such as that from Simian Virus 40 (SV40) , Rous sarcoma 

virus (RSV) , adenovirus (ADV) and bovine papilloma virus 
(BPV) (See, Sambrook (1989) for examples of suitable 
promoters) . Mammalian cells may also require terminator 
sequences and poly A addition sequences? enhancer 
30 sequences which increase expression may also be included, 
and sequences which cause amplif ication of the gene may 
also be desirable. These sequences are known in the art. 

Vectors suitable for replication in mammalian 
cells are known in the art, and may include viral 
35 replicons, or sequences which ensure integration of the 
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appropriate sequences encoding the desired polypeptides 
into the host genome. 

A v ctor which is us d to express foreign DNA 
and which may be used in vaccine preparation is Vaccinia 
5 virus. In this case, the heterologous DNA is inserted 
into the Vaccinia genome. Techniques for the insertion 
of foreign DNA into the vaccinia virus genome are known 
in the art. and utilize, for example, homologous 
recombination. The insertion of the heterologous DNA is 

10 generally into a gene which is non-essential in nature, 
for example, the thymidine kinase gene (tic) , which also 
provides a selectable marker. Plasmid vectors that 
greatly facilitate the construction of recombinant 
viruses have been described (see, for example, Mackett et 

15 al. (1984) in "DNA Cloning", Vol. II. IRL Press, p. 191, 
Chakrabarti et al. (1985), Mol . Cell Biol. fi:3403; Moss 
(1987) in "Gene Transfer Vectors for Mammalian Cells" 
(Miller and Calos, eds. , p. 10) . Expression of the 
desired polypeptides comprised of immunoreactive regions 

20 then occurs in cells or individuals which are infected 
and/or immunized with the live recombinant vaccinia 
virus . 

Other systems for expression of polypeptides 
include insect cells and vectors suitable for use in 

25 these cells. These systems are known in the art, and 

include, for example, insect expression transfer vectors 
derived from the baculovirus AUtpgrapfrtt CftliCQmica 
nuclear polyhedrosis virus (AcNFV) , which is a helper- 
independent, viral expression vector. Expression vectors 

30 derived from this system usually use the strong viral 
polyhedron gene promoter to drive expression of 
heterologous genes. Currently the most commonly used 
transfer vector for introducing foreign genes into AcNPV 
is pAc373. Many other vectors, known to those of skill 

35 in the art, have also been designed for improved 
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expression. These include, for example, pVL985 (which 
alters the polyhedron start codon from ATG to ATT, and 
which introduces a BamHI cloning site 32 basepairs 
downstream from the ATT; See Luckow and Summers (1989), 
5 Virology 17:31. Good expression of nonfused foreign 

proteins usually requires foreign genes that ideally have 
a short leader sequence containing suitable translation 
initiation signals preceding an ATG start signal. The 
plasmid also contains the polyhedron polyadenylation 
10 signal and the ampicillin- resistance (amp) gene and 

origin of replication for selection and propagation in £. 
coli . 

Methods for the introduction of heterologous 
DNA into the desired site in the baculovirus are known in 

15 the art. (See Summers and Smith, Texas Agricultural 

Experiment Station Bulletin No. 1555; Ju et al . (1987), 
in "Gene Transfer Vectors for Mammalian Cells (Miller and 
Calos, eds.); Smith et al. (1983), Mol. & Cell. Biol. 
2:2156; and Luckow and Summers (1989), supra). For 

20 example, the insertion can be into a gene such as the 

polyhedron gene, by homologous recombination; insertion 
can also be into a restriction enzyme site engineered 
into the desired baculovirus gene. The inserted 
sequences may be those which encode all or varying 

25 segments of the desired HCV polypeptides including at 
least one epitope from a variable domain. 

The signals for posttranslational 
modifications, such as signal peptide cleavage, 
proteolytic cleavage, and phosphorylation, appear to be 

30 recognized by insect cells. The signals required for 
secretion and nuclear accumulation also appear to be 
conserved between the invertebrate and vertebrate cells. 
Examples of the signal sequences from vertebrate cells 
which are effective in invertebrate cells are known in 

35 the art, for example, the human interleukin 2 signal 
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(IL2.) which is a signal for transport out if the cell, 
is recogniz d and properly removed in insect cells. 

It is often desirable that the polyp ptides 
prepared using the above host cells and vectors be fusion 
5 polypeptides. As with non- fusion polypeptides, fusion 
polypeptides may remain intracellular after expression. 
Alternatively, fusion proteins can also be secreted from 
the cell into the growth medium if they are comprised of 
a leader sequence fragment. Preferably, there are 

10 processing sites between the leader fragment and the 

remainder of the foreign gene that can be cleaved either 
in vivo or in vitro . 

In cases where the composition is to be used 
for treatment of HCV, it is desirable that the 

15 composition be immunogenic. In instances wherein the 

synthesized polypeptide is correctly configured so as to 
provide the correct epitope, but is too small to be 
immunogenic, the polypeptide may be linked to a suitable 
carrier. A number of techniques for obtaining such 

20 linkage are known in the art, including the formation of 
disulfide linkages using N-succinimidyl-3 - (2-pyridyl - 
thio) propionate (SPDP) and succinimidyl 4- (N- 
maleimidomethyl) cyclohexane- l-carboxylate (SMCC) (if the 
peptide lacks a sulfhydryl group, this can be provided by 

25 addition of a cysteine residue.) These reagents create a 
disulfide linkage between themselves and peptide cysteine 
resides on one protein and am amide linkage through the 
6- amino on a lysine, or other free amino group in other 
amino acids. A variety of such diaulf ide/amide- forming 

30 agents are known. See, for example, Immun. Rev. (1962) 
£2:185. Other bifunctional coupling agents for a 
thioether rather than a disulfide linkage. Many of these 
thio- ether- forming agents are commercially available and 
include reactive esters of 6-maleimidocaproic acid, 2- 

35 bromoacetic acid, 2-iodoacetic acid, 4- (N-maleimido- 
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methyl) cyclohexane- l-carboxylic acid, and the like. The 
carboxyl groups can be activated by combining them with 
succinimide or 1-hydroxyl- 2 -nitro-4- sulfonic acid, sodium 
salt. Additional methods of coupling antigens employ the 
5 rotavirus/ "binding peptide" system described in EPO 

Publication No. 259,149. The foregoing list is not meant 
to be exhaustive, and modifications of the named 
compounds can clearly be used. 

Any carrier may be used which does not itself 

10 induce the production of antibodies harmful to the host. 
Suitable carriers are typically large, slowly metabolized 
macromolecules such as proteins; polysaccharides such as 
latex functional ized sepharose, agarose, cellulose, 
cellulose beads and the like; polymeric amino acids, such 

15 as poiyglutamic acid, polylysine, and the like; amino 
acid copolymers; and inactive virus particles (see 
infra.). Especially useful protein substrates are serum 
albumins, keyhole limpet hemocyanin , immunoglobulin 
molecules, thyroglobulin, ovalbumin, tetanus toxoid, and 

20 other proteins well known to those of skill in the art. 

The immunogenicity of the epitopes of the HCV 
variable domains, particularly of El and E2/NS1, may also 
be enhanced by preparing them in eukaryotic systems fused 
with or assembled with particle- forming proteins such as, 

2 5 for example, that associated with hepatitis B surface 

antigen. See, e.g., U.S. Patent No. 4,722,840. 
Constructs wherein the polypeptide containing the HCV 
epitope from a variable domain is linked directly to the 
particle- forming protein coding sequences produces 

3 0 hybrids which are immunogenic with respect to the HCV 

epitope. In addition, all of the vectors prepared 
include epitopes specific to HBV, having various degrees 
of immunogenicity, such as, for example, the pre-S 
peptide. Thus, particles constructed from particle 

35 
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forming protein which include HCV sequences are 
immunogenic with respect to HCV and HBV. 



Hepatitis surface antigen (HBSAg) has been 



shown to be formed and assembled into particles in £. 



5 




'visiag (Valenzuela et al. (1982) , Nature 22fi:344, as 



well as in, for example, mammalian cells (Valenzuela et 
al. (1984), in "Hepatitis B", Millman I. et al., ed.). 
The formation of such particles has been shown to enhance 
the immunogenicity of the monomer subunit. The 

10 constructs may also include the immunodominant epitope of 
HBSAg, comprising the 55 amino acids of the presurface 
(pre-S) region. Neurath et al. (1984). Constructs of 
the pre- S- HBSAg particle expressible in yeast are 
disclosed in E.P.O. Publication No. 174,444; hybrids 

15 including heterologous viral sequences for yeast 
expression are disclosed in E.P.O. Publication No. 
175,261. These constructs may also be expressed in 
mammalian cells such as CHO cells using an SV40- 
dihydrofolate reductase vector (Michelle et al. (1984)) . 

20 In addition, portions of the particle -forming 

protein coding sequence may be replaced with codons 
encoding an epitope from an HCV variable domain. In this 
replacement, regions which are not required to mediate 
the aggregation of the units to form immunogenic 

25 particles in yeast or mammals can be deleted, thus 
eliminating additional HBV antigenic sites from 
competition with the HCV epitope (s) . 



30 known to one skilled in the art. Typically, such 

vaccines are prepared as injectables, either as liquid 
solutions or suspensions; solid forms suitable for 
solution in, or suspension in, liquid prior to injection 
may also be prepared, the preparation may also be 

35 emulsified, or the polypeptide (s) encapsulated in 



The preparation of vaccines which contain an 
imnunogenic polypeptide (s) as an active ingredient ( s ) is 
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liposomes. Th active immunogenic ingredients are often 
mixed with excipients which ar pharmaceutical!/ 
acceptable and compatible with the active ingredient. 
Suitable excipients are, for example, water, saline, 
5 dextrose, glycerol, e.thanol, or the like and combinations 
thereof. In addition, if desired, the vaccine may 
contain -minor amounts of auxiliary substances such as 
wetting or emulsifying agents, pH buffering agents, 
and/or adjuvants which enhance the effectiveness of the 
10 vaccine. Examples of adjuvants which may be effective 
include, but are not limited to: aluminum hydroxide, N- 
acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP) , N- 
acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 11637), 
referred to as nor-MDP) , N-acetylmuramyl-L-alanyl-D- 
15 isoglutaminyl-L- alanine- 2- (l' -2' -dipalmitoyl-sn-glycero- 
3 -hydroxyphosphoryloxy) -ethyl amine (CGP 19835A, referred 
to as MTP - PE , and RIBI, which contains three components 
extracted from bacteria, monophosphoryl lipid A, 
trehalose dimycolate and cell wall skeleton (MPL+TDM+CWS) 
20 in a 2% squalene/Tween 80 emulsion. The effectiveness of 
an adjuvant may be determined by measuring the amount of 
antibodies directed against an immunogenic polypeptide 
containing an HCV epitope from a variable domain, the 
antibodies resulting from administration of this 
25 polypeptide in vaccines which are also comprised of the 
various adjuvants. 

The proteins may be formulated into the vaccine 
as neutral or salt forms. Phannaceutically acceptable 
salts include the acid addition salts (formed with free 
30 amino groups of the peptide) and which are formed with 
inorganic acids such as, for example, hydrochloric or 
phosphoric acids, or organic acids such as acetic, 
oxalic, tartaric, maleic, and the like. Salts formed 
with the free carboxyl groups may also be derived from 
35 inorganic bases such as, for example, sodium, potassium, 



ammonium, calcium, or ferric hydroxides, and such organic 
bases as isopropylamine, trimethylamine, 2 -ethylamino 
ethanol, histidine, procaine, and the like. 

The vaccines are conventionally administered 
5 parenterally, by injection, for example, either 
subcutaneous ly or intramuscularly. Additional 
formulations which are suitable for other modes of 
administration include suppositories and, in some cases, 
oral formulations. For suppositories, traditional 

10 binders and carriers may include, for example, 

polyalkylene glycols or triglycerides ; such suppositories 
may be formed from mixtures containing the active 
ingredient in the range of 0.5% to 10%, preferably l%-2%. 
Oral formulations include such normally employed 

15 excipients as, for example, pharmaceutical grades of 
mannitol, lactose, starch, magnesium stearate, sodium 
saccharine, cellulose, magnesium carbonate, and the like. 
These compositions take the form of solutions, 
suspensions, tablets, pills, capsules, sustained release 

20 formulations or powders and contain l0%-95% of active 
ingredient, preferably 25% -70%. 

In addition to the above, it is also possible 
to prepare live vaccines of attenuated microorganisms 
which express recombinant polypeptides of the HCV antigen 

25 sets. Suitable attenuated microorganisms are known in 

the art and include, for example, viruses (e.g., vaccinia 
virus) as well as bacteria. 

The vaccines are administered in a manner 
compatible with the dosage formulation, and in such 

30 amount as will be prophylactically and/or therapeutically 
effective. The quantity to be administered, which is 
generally in the range of 5 fig to 250 p9 of antigen per 
dose, depends on the subject to be treated, capacity of 
the subject's immune system to synthesize antibodies, and 

35 the degree of protection desired. Precise amounts of 
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active ingredient required to be administered may depend 
on the judgment of the practitioner and may be peculiar 
to each individual. 

The vaccine may be given in a single dose 
5 schedule, or preferably in a multiple dose schedule. A 
multiple dose schedule is one in which a primary course 
of vaccination may be with 1-10 separate doses, followed 
by other doses given at subsequent time intervals 
required to maintain and/or reenforce the immune 

10 response, for example, at 1-4 months for a second dose, 

and if needed, a subsequent dose(s) after several months. 
The dosage regimen will also, at lest in part, be 
determined by the need of the individual and be dependent 
upon the judgment of the practitioner. 

15 In addition, the vaccine containing the antigen 

sets comprised of HCV polypeptides described above, may 
be administered in conjunction with other 
immunoregulatory agents, for example, immune globulins. 

The compositions of the present invention can 

20 be administered to individuals to generate polyclonal 
antibodies (purified or isolated from serum using 
conventional techniques) which can then be used in a 
number of applications. For example, the polyclonal 
antibodies can be used to passively immunize an 

25 individual, or as immunochemical reagents. 

In another embodiment of the invention, the 
above -described inmino reactive compositions comprised of 
a plurality of HCV antigen sets are used to detect 
ant i- HCV antibodies within biological samples, including 

30 for example, blood or serum samples. Design of the 

inmunoassays is subject to a great deal of variation, and 
a variety of these are known in the art. However, the 
immunoassay will use antigen sets wherein each antigen 
set consists of a plurality of substantially identical 

35 polypeptides comprising the amino acid sequence of an 
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epitope within a first variable domain of an HCV isolate, 
and the amino acid segu nee of one set is heterogeneous 
with respect to th amino acid sequence of at least one 
other set. Protocols for the immunoassay may be based, 
5 for example/ upon competition, or direct reaction, or 
sandwich type assays. Protocols may also, for example, 
use solid supports, or may be by immunoprecipitation. 
Most assays involve the use of labeled antibody or 
polypeptide; the labels may be, for example, fluorescent, 

10 chemiluminescent , radioactive, or dye molecules. Assays 
which amplify the signals from the probe are also known; 
examples of which are assays which utilize biotin and 
avidin, and enzyme- labeled and mediated immunoassays, 
such as ELXSA assays. 

15 Kits suitable for immunodiagnosis and contain- 

ing the appropriate labeled reagents are constructed by 
packaging the appropriate materials, including the 
compositions of the invention containing HCV epitopes 
from variable domains, in suitable containers, along with 

20 the remaining reagents and materials (for example, 

suitable buffers, salt solutions, etc) required for the 
conduct of the assay, as well as a suitable set of assay 
instructions. 

Described below are examples of the present 

25 invention which are provided only for illustrative 
purposes, and not to limit the scope of the present 
invention. In light of the present disclosure, numerous 
embodiments within the scope of the claims will be appar- 
ent to those of ordinary skill in the art. 



35 
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Examples 

In the Examples the following materials and 
methods were used. 

Patient Samples and RNA Extraction 
5 Asymptomatic HCV carriers HCT 18 and HCV Jl and 

chronically infected HCV patient Th have been previously 
described in Weiner et al . (1991) Virol . 180:842-848 . 
Patient Q was diagnosed with chronic active hepatitis 
based on a liver biopsy and was placed on alfa-2b 

10 interferon therapy (3 million units, thrice weekly) for 
six months. RNA from 0.2 ml of plasma was extracted 
according to the method of Chomcynski and Sacchi, (1987) 
Anal. Biochem. 1£2: 156-159, using RNAzol™ B reagent 
(Cinna/Biotecx Laboratories) containing 10 /ig/ml MS2 

15 carrier RNA (Boehringer Mannheim, 165*948) as indicated 
by the manufacturer. RNA was resuspended in 200 pi of 
diethyl pyrocarbonate treated distilled water and 
reprecipitated in a final concentration of 0.2M sodium 
acetate and two and one half volumes of 100% ethanol 

20 (-20*0. 

cDNA and Polymerase Chain Reactions 

All reactions were performed according to 
Weiner et al. (1990) Lancet ii5:l-5. M13 sequencing was 
25 performed according to Messing et al. (1983), Methods in 
Enzymology 101:20-37. The consensus sequence of at least 
four cloned inserts are presented with the exception of 
thm HCV Jl.2 E2/NS1 sequence which was derived from two 
clones . 

30 Cloning and sequencing of HCT 18 and Th was as 

reported in Weiner et al. (1991), supra. Nested PCR 
primers used to clone the amino terminal and carboxy 
proximal segments of E2/NS1 in patient Q were: 
PCR I 

35 X(E2)14 GGTGCTCACTGGGGAGTCCT(1367-13 86)S 
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X(E2) 18J CATTGCAGTTCAGGGCCGTGCTA (1608-1588)A, 

PCR II 

X(E2)4 TCCATGGTGGGGAACTGGGC ( 1406- 1425 )S 
X(E2)19J TGCCAACTGCCATTGGTGTT(15a2-1562)A; 

5 PCR I 

X(E2)14 (above) S 
' Jlrcl2 TAACGGGCTGAGCTCGGA(2313-2296)A 

PCR II 

US(E2)5 CAATTGGTTCGGTTGTACC (i960 - 1978 ) S 
10 Jlrcl3 CGTCCAGTTGCAGGCAGCTTC ( 226 0-2240) A. 

PCR primers used to clone the HCV Jl E2/NS1 gene were: 
PCR I 

J1(E2)14 (above) S 

Jl (E2) rc30~ CAGGGCAGTATCTGCCACTC (2349 -2330) A 
15 J1IZ-2* TGAGACGGACGTGCTGCTCCT (I960- 1978 )S 

Jl ( E2 ) r C3 2 " TTTGATGTACCAGGCGGCGCA (2658-2636)A 
PCR II-E2384.5* 

GGATCCGCTAGCCATACCCGCGTGACGGGGGGGGTGCAA (1469- 
1495)S 

20 DSC0N1JBX* 

GGATCCTCTAGATTACTCITCTGACCTATCCCTGTCCTCCAAGTC 
ACA(2272-2301) A 

J1I2-1" CAACTGGTTCGGCTGTACA( 1915- 1935 )S 

Jl(E2)rc31~ (2566-2546)A. 
25 nt sequence from TaJceuchi et al., (1990) Nucl. Acids 

Res. 18:4626; **, nt sequence from Kato et al., (1989) 
Proc. Jpn. Acad. £5fl: 219-223. Sense (S) or antisense (A) 
PCR primers are given in the 5' to 3' orientation 
according nucleotide numbers in reference. 

30 

Syneheala of Biotinvlacad Pepcidea 
The overlapping octapeptides for the 
hypervariable regions of three strains of HCV were 
synthesized on cleavable- linker, derivatized. 

35 
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poly thylene pins essentially as described by (Maeji et 
al., (1990) J. Immunol- Methods 114:23-33, was coupled to 
the N- terminus of each peptide. Finally, biotin was 
coupled to the N- terminus using 150 pi of a 
5 dimethylf ormamide solution containing 40 xnM biotin, 4 0 mM 
1-hydroxybenzotriazole (HOBt) , 40 mM 
benzo t r iazole - 1 - yl - oxy - t ris - pyrrl idino - phosphonium 
hexaf luorophosphate (PyBOP, NOVABIOCHEM) and 60 mM 
N-methylmorpholine (NMM) reacting overnight at 20 # C. 
10 After biotinylation, the peptides were 

side- chain deprotected, washed and. the peptide from each 
pin was cleaved in 200 pi of 0.1M phosphate buffer (pH 
7.2) . Microtitre plates containing the cleaved peptide 
solutions were stored at -20*C until needed. 

15 

EIjTSA Testing of Biotinvlated Peptides 
Polystyrene plates (Nunc immuno plate xnaxisorb 
F96) were coated with streptavidin by incubating 
overnight at 4°C with 0-1 ml/well of a 5 pg/ml solution 

20 of streptavidin (Sigma Cat. No. S4762) in 0.1 M carbonate 
buffer at pH 9.6. After removal of the streptavidin 
solution, the wells were washed four times with a 0.1% 
solution of Tween 20*in PBS. Nonspecific binding was 
blocked by incubating each well with 0.2 ml of 2% BSA in 

25 PBS for 1 h at 20°C. The wells were again washed four 
times with PBS/Tween 20. Plates were air-dried and 
stored at 4 P C until required. The streptavidin in each 
well was coupled to cleaved peptides by incubation with 
100 Ml of a 1:100 dilution of cleaved peptide solution 

30 with 0.1% BSA in PBS containing 0.1% sodium azide for 1 h 
at 20°C. After incubation, the plate was washed four 
times with PBS/Tween 20*. Bach well was incubated with 
100 pi of a suitable dilution of serum (diluted with 2% 
BSA in PBS containing 0.1% sodium azide) for 1 h at 20 °C 

35 or overnight at 4°C followed by four washes with 

* trademark 
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PBS/Tween 20* Bound antibody was detected by reaction 
for l h at 20°C in 0 . l ml conjugate. This consisted of 
0.25 ml/1 (a saturating level) of horseradish peroxidase - 
labeled goat anti-rabbit IgG (H+L) (Kirkegaard and Perry 
5 Labs, Gaithersburg, MD) in CASS (0.1% sheep serum, 0.1% 
Tween 20* 0.1% sodium caseinate diluted in 0.1M PBS , pH 
7.2). The wells were washed 2 times with PBS/Tween 20* 
followed by two washes with PBS only. The presence of 
enzyme was detected by reaction for 45 min at 20 "C with 

10 0.1ml of a freshly -prepared solution containing 50 mg of 
ammonium 2,2' -azino-bis [3 -ethylbenzothiazoline- 
6-sulphonate (ABTS, Boehringer Mannheim Cat. no. 122661) 
and 0.03 ml of 35% (w/w) hydrogen peroxide solution in 
100 ml of 0.1 M phosphate/0. 08 M citrate buffer, pH 4.0. 

15 Color development was measured in a Titertek*Multiscan MC 
plate reader in the dual wavelength mode at 4 05 nm 
against a reference wavelength of 492 nm. 

Computer Generated Antigenicity Profile 

20 Antigenicity profiles for the HCV E2/NS1 

protein and HXV-l gp!20 hypervariable region V3 (aa 303- 
338) were derived from a computer program based on the 
degree of sequence variability as originally proposed by 
Rabat [Sequences of proteins of immunological interest. 

25 U.S. Department of Health and Human Services , Public 

Health Service, National Institutes of Health (1983)] for 
the identification of the hypervariable loops of 
innnunoglobulins multiplied by the average of the 
individual probability that antibody binding is retained 

30 for each possible pair-wise amino acid. Probabilities 

for retention of antibody binding associated with a given 
amino acid change were the values experimentally 
determined by assessing the effects on antibody binding 
of all possible amino acid substitutions for 103 

35 characterized linear epitopes. Geysen et al., (1988) J. 

* tradanark 
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Mol. Rec. 1:32-41. Thi9 algorithm thus weights the 
variability index to give more significance to amino acid 
changes likely to have a significant effect on antibody 
binding, i.e., compensates for conservative amino acid 
5 changes. Fifteen HCV sequences [HCV-1, Q3.2. HCT 23, 

EC10, HC-J1, HCVE1, TH, HCT 27, Q1.2, HCT18, HC-J4, HCV 
J1.2/HCV Jl.l, HCV J , HCV BK] , were used to determine 
the antigenicity profile for HCV. The HIV-1 V3 profile 
was obtained by averaging 242 individual profiles of 15 
10 sequences selected at random from the numerically greater 
data base of unique HXV-l sequences. LaRosa et al., 
(1990) Science 241:932-935 & Correction in Science (1991) 
p. 811. The amino acid sequences of some of these 
isolates between aa 384 and 420 are shown in Figure 3. 

15 

Computer Generated Secondary structure 
Predict iona 

The or* helix, 0- sheet and 0- turn secondary 
structure probabilities for the amino- terminal region 

20 (384-420) were determined using an algorithm, which 

assigns the probabilities for each of the three above 
secondary structural motifs to each residue. The 
coefficients used in the algorithm were obtained for all 
pair-wise combinations of residues of the structural data 

25 base. Levitt and Greer, (1977) J. Mol. Biol. 

114 : 181-293 - The prediction parameters obtained from 
these coefficients were fitted to the observed outcome 
when the algorithm was applied back on the database to 
obtain probabilities that a given residue would be found 

30 in one of the three defined secondary structural motifs. 
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Example 1 

comparison of Secondary Structure and Amino 
Acid Secruence Variation in the HCV E2/NS1 HV 
5 and HIV-1 crp!20 Domains 

The amino acid sequences from fifteen HCV and 
HIV-1 isolates were compared with respect to the number 
of positions at which amino acid sequence heterogeneities 
10 were observed in the HCV £2 HV or HIV- 1 gpl20 V3 domains 
(Figure 4, A and B, respectively). Amino. acid 
heterogeneities occurred in 25 of 30 amino acid positions 
in the £2 HV region and 23 of 35 amino acid positions in 
the HIV- i gpl20 V3 domain. Dashes on the x-axis of 
15 Figure 4 A and B represent amino acid positions where 
variable amino acid residues occur and invariant amino 
acids are given in the single letter amino acid code. 
The antigenicity profiles shown in Figure 4 indicate 
that, similar to the V3 loop of the HIV-i gpl2 0 protein 
20 (Figure 4B) , a block of amino acid residues in the HCV E2 
(amino acids 384-414 in Figure 4A) was identified whose 
variation haid a predicted adverse affect on antibody 
i binding. The data in figure 4 indicate that the HCV E2 
•domain resembles the HIV-1 gpl20 V3 domain, which is 
25 'known to encode virus neutralizing epitopes, in both the 
.degree and predicted significance of observed amino acid 
variation and suggests that the £2 HV domain may have a 
similar function as the gpl20 V3 domain. 

_ ... Linear epitopes" are more likely associated with 

30 less structured regions of proteins, in particular, the 
ends of proteins or with extended surface loops. A 
computer analysis was used to predict the probability 
that an individual residue is associated with a defined 
secondary structural motif for 15 E2 HV amino acid 
35 sequences between residues 384 to 420. Figure 4 shows 
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that the region between the E2 amino- terminal residue 3 84 
and th strongly predicted, highly conserved beta- turn 
(residues 415-418) is relatively unstructured as 
indicated by less than 50 percent probability of 
5 alpha- helix, beta- sheet or beta- turn character- ' Lack of 
strongly predictive structure in the E2 HV domain is 
consistent with the tolerance for extensive sequence 
variation found between isolates and is in contrast with 
highly structured regions which contribute to tertiary 

10 folding of the protein. The HCV E2 HV domain appears to 
be even less structured than the V3, principal 
neutralizing domain of HXV-i gpl2 0, which has been 
reported to contain a beta strand- type II beta turn-beta 
strand- alpha helix motif and may have greater structural 

15 constraints on amino acid variability than the HCV E2 HV 
domain. Taken together, the evidence suggests that the 
E2 HV domain appears to have features characteristic of 
protein domains which contain likely sites of linear 
neutralizing epitopes. 
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Example 2 

Epitope Mapping Qt the HCa E2/NS1 HV Domain 

Overlapping biotinylated 8-mer peptides 
corresponding to and extending past the E2/NS1 HV domain 
(amino acids 384 to 416) of HCT 18 (A,D) , Th (B, E) and 
HCV Jl (C,P) were bound to plates coated with 
streptavidin and reacted with plasma from either HCT 18 
(A-C) or Th (D-F) . The results are shown in Figure 6 for 
HCV isolates HCT 18 (Fig. 6A and 6D) , Th (Fig. 6B and 
6E) . and HCV Jl (Fig. 6C and SF) . HCT 18 plasma was 
diluted 1:200 and Th plasma was diluted 1:500. HVE-i, 
-2, -3, -4 and -5, represent isolate specific epitopes. 

As seen from Figure 6, HCT 18 plasma identi- 
fied a linear epitope (^PKQNV 411 ) when tested with 
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peptides derived from the HCT18 sequence (HVE-I in Figure 6A) , 
but failed to react with peptides corresponding to the HV 
domain of two different strains Th and HCV Jl ( Figures 6B 
and 6C) . In contrast, Th plasma identified linear 

5 epitope HVE-IV in the HV domain of Th ( 409 QNIQLI 414 , Figure 6B, 
panel E) , and also epitopes in strain HCT 18 ( 399 IVRFFAP 405 , 
Figure 6A,panel D) , and HCV Jl. Th, an IV drug user, may have 
been exposed to multiple strains of HCV. 

Both Th and HCT 18 plasma each reacted with an 

10 epitope (amino acids 413-419) coamon to all three 

isolates (data not shown) when used in an ELISA with pin 
synthesized overlapping Bmer peptides from each isolate. 

In-order to validate antibody binding 
specificity, antibodies bound to biotinylated peptides 

15 containing amino acids 403-407 were eluated and used to 
block the reactivity of HCT 18 plasma with pins 
containing overlapping 8-mers for the HCT 18 HV domain. 
These data indicate that 1) the E2/KS1 HV domain is 
immunogenic, 2) there are multiple epitopes which map to 

20 this region, and 3) a subset of epitopes (HVE-l, -2, -3, 
-4 or -5 in Figure 6) in the HV domain are isolate 
specific. 

Example 3 

25 Determination that Variant E2/FS1 gv Domains 

Can Be Associated mxh Flares o£ Hepatitis 

To investigate the possibility of finding HCV 
variants associated with the intermittent flares of 
hepatitis often found in chronic HCV infections, we 
partially sequenced the E2/NS1 gene from a patient, Q, 
with chronic hepatitis during two distinct episodes of 
hepatitis approximately two years apart (Ql and Q3, 
respectively) . The second episode of hepatitis occurred 
1.5 years after the termination of interferon treatment. 
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Th differences in the deduced amino acid 
sequence of the Ql and Q3 E2/NS1 HV region was strikingly 
different only between amino acids 391-408 with seven of 
eight changes occurring between amino acid 398 and 407 
5 (Figure 7). Figure 7 shows the deduced amino acid 

sequences of two regions of the E2/NS1 polypeptide, amino 
acids 384-414 and 547-647, for the Ql and Q3 isolates. 
The amino acid (E) above the Ql sequence was found in one 
of four Ql clones. The boxed amino acids represent the 
10 location of the Ql or Q3 HVE I2mer peptide. Amino acid 
sequence differences found between Ql and Q3 are printed 
in bold type. 

Only one amino acid heterogeneity was observed 
between amino acids 547 and 647 of the Ql and Q3 E2/NS1 
15 polypeptides (Figure 7) . 

To examine the effect of the amino acid 
substitutions observed in the Ql and Q3 E2 HV domains on 
antibody binding, ire synthesized a Ql and Q3 specific 
12-mer peptide from amino acids 396 to 407 (HVE Ql or Q3 
20 in Figure 7B) and separately reacted the Ql and Q3 plasma 
with each peptide in an ELISA. Table 4 shows that 
antibodies in both the Ql and Q3 plasma reacted with the 
Ql peptide but not with the Q3 peptide. Statistical 
analysis (Student's Test) indicated that the binding of 
25 the Q1/Q3 plasma to the Ql peptide was significantly 

above background binding of those plasma to a panel of 12 
randomly chosen control peptides (P<0.001), while binding 
of either the Ql or Q3 plasma to the Q3 peptide was not 
statistically significant. The data indicate that 
30 although patient Q developed antibodies to the HCV Ql HV 
domain, which were still detectable two years later at 
the Q3 time point, no detectable humoral response had 
developed to the Q3 E2 HV variant which was predominant 
during the second episode of hepatitis. 
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Table 4 

Elisa Results on 12-mer Peptides 

TARFAGFFQSGA TAGFVRLFETGP 
Plasma Ql seq Q3 seq 

Mean sd Mean sd 

Ql 1.158 0.134 0.691 0.123 

Q3 1.022 0.123 0.-93 0.036 



example 4 

Detection of Coexisting E2/NS1 Genes With 
Distinct E2/NS1 HV Domains in HCV Infected 
Individuals 

Figure 8 A shows the amino acid sequences 
deduced from two isolates of HCV Ji (Ji.i & Jl.2) which 
were cloned from one plasma sample of the Japanese 
volunteer blood donor HCV Jl. Kubo et al., (1989) Nucl. 
Acids Res. 12:10367-10372. Of the 23 total amino acid 
changes between HCV Jl.l and HCV Jl.2, 9 differences 
indicated by bold type are clustered in the 30 amino acid 
E2/NS1 HV domain. Five of the 9 amino acid substitutions 
in the E2/NS1 HV domain represent nonconservative amino 
acid changes. Since HCV Jl is the only group II HCV 
genome which has been cloned in our laboratory, it is 
unlikely that these differences are due to cross contami- 
nation of the HCV Jl plasma. The HCV Jl.2 sequence 
represents a minority sequence in HCV Jl's blood since 
only two E2/NS1 HV variant sequences were identified from 
7 cloned sequences which originated from two independent 
PGR reactions. 

Interestingly, a comparison of the HCT27 and 
HCV El isolates (Figure 8-2)* which were sequenced in 
different laboratories and derive from presumably 
unrelated individuals, showed that the number of amino 
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acid .differences in the E2/NS1 MV domain of these 
isolates were fewer than the number of differences 
observed between isolates from the same individual. 

The above described results lead to the 
5 suggestion that the HCV genome is rapidly evolving in 
individuals and the population. 

EXAMPLE 5 

Formulation and Preparation of Vaccine 
Coupling of the Diphtherip' Toxoid Carrier Protein to MCS 
10 Materials Required 

ethylene diamine-tetra-acetic acid (EDTA Na 2 .2H 2 0) (MW 
372) 

6-inaleimido-caproic acid N-hydroxysuccin jmide ester 
15 (MCS) (Sigma) -95%pure 

sodium dihydrogen orthophosphate (NaH 2 P0 4 ) 
nitrogen 

dimethyl fonmamide (DMF) 
Milli Q water 

20 0.1 M phosphate buffer containing 5 EDTA, pH 6.66 
0.1 M phosphate buffer, pH 8.0 
0.1 M phosphate buffer, pH 7 . 0 
sodium succinate [ (CH 2 COONa) 2 • 6H 2 0] 
cysteine 

25 hydrochloric acid (21 solution) 

0.1 M sodium succinate/0.1 EDTA, pH 5 . 6 

Purified diphtheria toxoid (Comonwealth Serum 
Laboratories, Victoria, Australia) was coupled to MCS 

30 according to the method described by Lee et al., (1980) 
Mol. Ixmnunol . 17:749; Partis et al., (1983) Prot. Chem 
2:263; Peeters et al., (1989) J. Immuunol. Methods 
120:133; Jones et al . , (1989) J. Immunol. Methods 
123:211. 100 ml of diphtheria toxoid was passed through 

35 a G25 Sephadex™ column (17cm X 4 cm) to remove thiomers 
The toxoid was eluted with 0.1 M phosphate buffer pH 7.0 
and the protein content of the eluate was assayed using 
the ECA protein determination (Pierce) . The resulting 
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solution was concentrated using an Amicon ultrafiltration 
unit to a final concentration of 10 mg/ml. 

One milliliter of the toxoid solution was 
dialyzed with 0.1 M phosphate buffer, pH 8.0, and then 
5 mixed with a solution of 1.5 tng MCS in 200 jxl DMF. The 
resulting solution was incubated at room temperature for 
1 hour ih the dark with occasional mixing. In order to 
separate the uncoupled MCS from the MCS -toxoid, the 
solution was passed through a Sephadex PD10 column which 

10 had been equilibrated with 0.1 M phosphate buffer, pH 
6.66 and the protein fraction was collected. 

The number of maleimido groups coupled per 
carrier molecule was determined prior to coupling of the 
HCV peptides thereto. Thirty milliliters of the 

15 succinate/EDTA buffer was sparged with nitrogen for 2 
minutes. Five milligrams of cysteine was transferred 
into a 25 ml volumetric flask and dissolved in a final 
volume of 25 ml of the sparged buffer. Aliquot s of the 
solutions shown in Table 5 were transferred in duplicate 

20 to 25 ml screw capped bottles. Using separate pipettes, 
nitrogen was bubbled into each aliquot. Each bottle was 
then sealed and incubated at room temperature in the dark 
for 40 minutes with occasional swirling. 

Table 5 

25 solution Sample (ml) Standard (ml) slants (ml) 

activated carrier 0.3 

phosphate buffer - 0.3 0.3 

cysteine solution 1.0 1.0 

succinate buffer - - 1.0 

* A 0.1 ml aliquot of each of the 3 solution was taken 
30 for an Ellman'a determination. 
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Ellman's Test for the Quantitative US LS^inSLLlsm fff 

Sulfhvdryl 

Materials Required 

Phosphate buffer, pH 8.0 
5 Dissolve 15,6 g NaH 2 P0 4 or 12.0 g 

NaH,P0 4 anhydrous in approximately 700 
ml Milli Q water. Adjust the pH to 
• 8.0 using 50% NaOH. Add Milli Q water 
for a final volume of 1000 ml and 
then adjust the pH if necessary. 
Ellman's Reagent 

Dissolve 10.0 mg of 5, 5' -dithiobis-2 - 
10 nitrobenzoic acid (DTNB) in 2.5 ml of phosphate 

buffer, pH 8.0 

0.1 ml of Ellman's reagent was added to each of 
the 0.1 ml aliquots of the solutions prepared above, 
namely the sample, standard and bland solutions. Five 

15 milliliters of phosphate buffer, pH 8.0, was then added 
to each aliquot, mixed well and allowed to stand for 15 
minutes. The absorbance of each aliquot was measured in a 
1 cm path length cell at 412 am. 

The number of maleimido groups present on the 

20 carrier protein was determined according to the following 
method. A 0.01 /unol per ml solution of -SH produces an 
absorbance of 0.136 in a 1 cm light path at 412 nm. The 
absorbance of the Standard or Sample (A) is equal to the 
amount of cysteine reacted with the coupled maleimido 

25 groups on the activated carrier protein. Since 1 mol of 
available -SH reacts with 1 mol of maleimido, the 
concentration in /mols of the maleimido groups present in 
the aliquot tested is equal to A<0.01) /0.136 >unol/ml. 
The total volume of the solution was 5.2 ml. Therefore, 

3 0 the total number of /imols present was equal to 

A(0.01) (5.2) /0.136. The sample solution had a total 
volume of 1.3 ml, of which 0.3 ml consisted of the 
activated carrier protein. The amount of maleimido 
groups present in the sample solution was calculated as 

35 A(0.01) (5.2) (1.3)7(0.136) (0.1) (0.3) -A(16.57) jimol/ml. 
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The MCS-activated carier protein was stored at -20°C. 
Reduction of the HCV Peptides 

Prior to coupling of the HCV peptides to the 
MCS-activated carrier protein, the peptides were reduced 
5 to ensure that thiol groups present on the pept ides were 
in the fiilly reduced -SH form. 
Materials Required 
dithiothreitol (DTT) 

ammonium hydrogen carbonate (NH4HCO1) 
10 methanol 

SEP-PAKs™ (C18 cartridge, Waters), 1 cartridge for each 8 

mg of peptide 
0.1 M ammonium hydrogen carbonate buffer 

Dissolve 7.9 g NH4HCO3 in 1 Milli Q 
15 water 

Buffer A, 0.1% v/v trif luoroacetic acid (TFA) in Milli Q 

water 

Buffer B, 60% v/v acetonitrile, 0.% v/v TFA in Milli Q 
water 

20 15 mg of each of two HCV pept ides corresponding 

to amino acids 384-411 and 225-260, respectively, of the 
HCV polyprotein were added to 2 . 5 ml pf 0 . 1 M armonium 
hydrogen carbonate containing a 10 fold molar excess of 
DTT. The resulting solutions were mixed until the peptide 

25 had dissolved and were then atlowed to stand for 1 hour 

at room temperature. Two pairs of SEP-PAKs were connected 
in series and activated by passing approximately 20 ml of 
methanol and then 20 ml of Buffer. A through each pair 
Q f SEP-PAKs™. Each peptide/DTT sample was slowly passed 

30 through a pair of SEP-PAKs™. The DTT was eluted with 20 
ml of Buffer A. The reduced peptide was eluted with 7 ml 
of Buffer B into pre-weighed bottle and then freeze-dried 
overnight. The bottles were then weighed to determine the 
amount of recovered peptide. The reduced peptides were 

35 then immediately coupled to the MCS-activated carrier 
protein. 
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Coupling HCV Peptides to MCS- Activated Carei» r Pmrp^n 

Approximately 100 ml of 0 . l M phosphate buffer 
with 5 mM EDTA, pH €.€6 was d gassed under vacuum and 
then sparged with nitrogen for 10 minutes. Twenty 
milliliters of a 10 mg/ml solution of the MCS- activated 
carrier protein was carefully sparged with nitrogen to 
prevent excessive frothing. 5 mg of each of the reduced 
peptides were dissolved in approximately 0.2 ml of the 
degassed sparged phosphate /EDTA buffer, pH 6.66 and then 
mixed with the MCS- activated carrier protein solution. 
The resulting mixture was transferred • into a screw capped 
bottle which vras then filled with nitrogen and sealed. 
The solution was further degassed by holding the bottle 
in a Branson 2000 e sonication bath for 2 minutes. The 
bottle was covered with aluminum foil and incubated 
overnight at roam tenperature with slow mixing on a 
shaker table. 

The resultant conjugate was soluble »r%A the 
uncoupled peptide was removed by passing the mixture over 
a Sephadex PD 10* column which had been equilibrated with 
the phosphate/EDTA buffer, pH 6.66. The protein fraction 
was collected. The amount of peptide conjugated to the 
carrier protein was determined by amino acid analysis. 

An amino acid analysis of 150 nl aliguots of 
both the conjugate and the carrier protein was performed. 
The average ratio of the level of amino acids contributed 
solely by the carrier protein mis determined to calculate 
the amount of conjugated peptide produced- Levels of 
serine, threonine, tryptophan, methionine, tyrosine and 
cysteine were not determined as these amino acids are 
modified under the standard hydrolysis conditions. 
Typical results obtained in these calculations are 
presented in Table 6. 
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AMINO ACID CARRIER ONLY CONJUGATE 

D 212 193 

E 194 170 

G 153 108 

R 60 56 

A 150 304 

P 79 1C3 

For the conjugate, the values in bold type are 

the amino acids that were also present in the peptides. 

For conjugates containing alanine and proline, the factor 

(193+179+180+56) / (212 ) +194+153+60) - 0.8659 is multiplied 

by the amount of the amino acid level in order to 

normalize the result. 

Preparation of Vaccine Composition 

Injectable compositions consisting of HCV 
peptides conjugated to MCS-activated diphtheria toxoid 
carrier protein prepared as described supra and a 
submicron oil-in-water emulsion adjuvant as described in 
PCT International Publication No. W09014837, published 
December 13, 1990. In addition, injectable compositions 
containing an immunostimulant , lipophilic muramyl peptide 
(MTP-PE, CIBA-GEIGY, Basel, Switzerland) in addition to 
HCV conjugated peptides and adjuvant were prepared. The 
vaccine compositions were generally comprised of 50% 
protein and 50% adjuvant. 

Formula for Vaccine Composition with mtp-pe 

To prepare 10 ml of injectable vaccine composition: 

2.5 ml Squalene (Sigma Chemical Co., St. Louis , Mo.) 
0.25 ml Tveen 80* (Sigma Chemical Co.) 
0.25 ml SPAN 85* (Sigma Chemical Co.) 
1000 Mg MTP-PE 

1*000 M9 HCV peptide conjugated to MCS-activated 
diphtheria toxoid carrier protein 
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Formula for Vaccine Composition without: MTP-PF 

T prepare 10 ml of injectable vaccine composition: 

2.5 ml Squalene (Sigma Chemical Co., St. Louis, Mo.) 
0.25 ml Tween 80* (Sigma Chemical Co.) 
0.25 ml SPAN 85* (Sigma Chemical Co.) 
1000 jig HCV peptide conjugated to MCS-activated 
diphtheria toxoid carrier protein 

Example 6 
Method for Testing Vaccina 
Preparations for Toxicity 

Vaccine prepared according to the methodology 
of Example 5 was tested for toxicity in small animals. 
Fifty microgram per kilogram of vaccine was administered 
to guinea pigs, mice and rabbits by intraperitoneal 
injection. The vaccine was also administered by 
intraperitoneal injection to rhesus monkeys and primates. 
Half of the test population of rhesus monkeys and 
primates received 5 pg/kg doses of the vaccine, vhile the 
other half received 50 Mg/kg dosages. Control animals 
employed in each of the studies were injected with a 
comparable amount of a composition consisting of the 
components of the vaccine preparation except the viral 
peptides. 

Each of the animals was monitored for symptoms 
indicative of a response to toxic material. More 
specifically, each animal in the study was examined bi- 
weekly for symptoms including fever, lethargy, weight 
loss, changes in eating habits and for lesions, swelling 
or tenderness at the site of injection. Lymph nodes 
proximal to the injection site were also examined for 
swelling and/or drainage. The animals were monitored on 
a bi-weekly basis for a period of several months. 



* trademark 



WO 93/06126 



-61- 



2116764 PCT/LS92/07683 



10 



15 



20 



30 



Example 7 

Demonstration or the Production gg 

Neutralizing Antibody in Vaccinated Animals 

Vaccine prepared according to the methodology 
of Example 5 was tested in chimpanzees in order to 
determine the effectiveness of the vaccine in eliciting 
the production of virus neutralizing antibody in 
vaccinated subjects. Chimpanzees were vaccinated with 5 
pg/kg dosages of vaccine prepared according to the 
methodology of Example 5 over a six-month time period at 
intervals of 0, 1, 3 and 6 months. Control chimpanzees 
were injected with comparable amounts of a composition 
consisting of the components of the vaccine except the 
viral peptides. Two weeks after the last dose of vaccine 
was administered, the test and control chimpanzees were 
each challenged with a 10 CIUjq (Chimpanzee Infectious 
Unit) dose of CDC/910 plasma inoculum. Commencing one 
week following the viral challenge, each of the 
chimpanzees was monitored for viremia on a weekly basis. 

In order to detect viremia, blood samples and 
liver biopsy specimens were collected from control and 
test animals on a weekly basis for several months. 
Tissue collected by liver biopsy was examined 
2 5 histologically for signs of necrosis and/or inflammation. 
In addition, hepatocytes from the biopsy material were 
examined by electron microscopy for the presence of 
tubules characteristic of HCV infection. The blood 
samples were also analyzed by the ELISA assay described 
supra for the presence of antibodies to segments of viral 
polypeptides which were not utilized in preparing the 
vaccina. In particular, each of the blood samples was 
screened by ELISA for the presence of antibodies to NS 3 , 
NS 4 and NS S peptides. The presence of antibodies to 

35 
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th s peptid s in the ••rum of a chimpanzee was 
indicative f HCV infect i n. 

Th following m thod was employed to detect 
viral RNA circulating in plasma or present in liver 
5 biopsy tissue collected from the chimpanzees* 

cPCft Method to Detect HCV RNA in Liver and in Serum 

In the cPCR assay , putative viral RNA in the 
sample is reverse transcribed into cDNA with reverse 

10 transcriptase; a segment of the resulting cDNA is then 
amplified utilizing a modified version of the PGR 
technique described by Saiki et al. (1986). The primers 
for the cPCR technique are derived from HCV RNA, which 
can be identified by the family of HCV cDNAs provided 

15 herein* Amplified product corresponding to the HCV-RNA 
is detected utilizing a probe derived from the family of 
HCV cDNAs provided herein. 

The cPCR/HCV assay used in these studies was 
performed utilizing the following methods for the 

20 preparation of RNA, the reverse transcription of the RNA 
into cDNA, the amplification of specific segments of the 
cDNA by PCR, and the analysis of the PCR products. 

RNA was extracted from liver utilizing the 
guanidium isothiocyanate method for preparing total RNA 

25 described in Maniatis et al* (1982). 

In order to isolate total RNA from plasma, the 
plasma was diluted five- to ten-fold with TENB (0.1 M 
NaCl, 50 mM Tris-HCl, pH 8.0, 1 IN EDTA) and incubated in 
a Proteinase K/SDS solution (0.5% SDS, 1 mg/ml Proteinase 

30 K, 20 micrograms/ml Poly A carrier) for 60 to 90 minutes 
at 37°C. The samples were extracted once with phenol (pH 
6.5), thm resulting organic phase was re-extracted once 
with TENB containing 0.1% SDS, and thai aqueous phases of 
both extractions were pooled and extracted twice with an 

35 equal volume of phenol/CHCl^/isoamyl alcohol [1:1(99:1)]. 
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Th resulting aqueous phas s w re extracted with an equal 
v lume of ChCl 3 /is amyl alcoh 1 (99:1) twice, and ethanol 
pr cipitated using 0.2 M sodium ac tate, pH 6.5, and 2.5 
volumes of 100% ethanol; precipitation was overnight at 
-20°C. 

The cDNX used as a template for the PCR re- 
action was prepared utilizing the designated samples for 
preparation of the corresponding cDHAs. Each RNA sample 
(containing either 2 micrograms of heat denatured total 
chimpanzee liver RNA or RNA from, 2 microliters of plasma) 
was incubated in a 25 microliter reaction containing 1 
micromolar of each primer, 1 millimolar of each 
deoxyribonucleotide triphosphate (dNTP) , 50 millimolar 
Tris-HCL, pH 8.3, 5 millimolar KgCl 2 , 5 millimolar 
dithiothreitol (DTT), 73 millimolar KC1, 40 units of 
RNase inhibitor (RNASIH) , and 5 units of AMV reverse 
transcriptase. The incubation was for 60 minutes at 
37°C. Following cDNA synthesis, the reactions were 
diluted with 50 microliters of deionized water (DIW) , 
boiled for 10 minutes, and cooled on ice. 

Amplification of a segment of the HCV cDNA was 
performed utilizing two synthetic oligomer 16-mer primers 
whose sequences were derived from HCV cDNA clones 3 6 
(anti-sense) and 37b (sense). The sequence of the primer 

from clone 36 was: 

5' CCA TCT CAT GAT CTA T 3'. 

The sequence of the primer from clone 37b was: 

5' ACA ATA CGT GTG TCA C 3'. 
The primers were used at a final concentration of 1 
micromolar each. In order to amplify the segment of HCV 
cDNA which is flanked by the primers, the cDHA samples 
were incubated with 0.1 microgram of RHAse A and the PCR 
reactants of the PerJcin Elmer Cetu. PCR kit* (H801-0043 or 
NB01-0055) according to the manufacturer's instructxons . 
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Th PCR reaction was p rformed for ither 30 cycles or 60 
cycles in a P rkin Elmer C tus DNA thermal cycler. Each 
cycle c nsist d of a 1 minute denaturation step ot 94°C, 
an annealing step of 2 minutes at 37°C, and an extension 
5 step of 3 minutes at 72°C. However, the extension step 
in the final cycle (30 or 60) was 7 minutes rather than 3 
minutes.) After amplification the samples were extracted 
with an equal volume of phenol: chloroform (1:1), 
followed by extraction with an equal volume of 

10 chloroform, and then the samples were precipitated with 
•thanol containing 0.2 M sodium acetate. 

The cPCR products were analyzed as follows. 
The products were subjected to electrophoresis on 1.8% 
alkaline agarose gels according to MuraJcawa et al. 

15 (1988), and transferred onto ZETAe Probe paper (BioRad 
corp.) by blotting gels overnight in 0.4 M NaOH. The 
blots were neutralized in 2 X SSC (1 X .SSC contains 0.15 
If NaCl, 0.015 M sodium citrate), prebybridized in 0.3 M 
NaCl, 15 mM sodium phosphate buffer, pH 6.8, 15 mM EDTA, 

20 10% SDS, 0.5% nonfat milk (Carnation Co.), and 0.5 mg/ml 
sonicated denatured salmon sperm DNA. The blots to be 
analyzed for HCV cDHA fragments were hybridized to a 
32 p-labeled probe generated by nick translation of the 
HCV cDNA insert sequence in clone 35, described in 

25 U.S. S.N. 07/456,637. After hybridization, the blots were 
washed in 0.1 X SSC (1 X SSC contains 0.1 5M NaCl, 0.0 1M 
Na citrate) at 65°C, dried, and autoradiographed. The 
expected product size is 586 nucleotides in length; 
products which hybridized with the probe and migrated in 

30 the gels in this size range were scored as positive for 
viral RNA. 

As a control, cPCR primers designed to amplify 
alpha-1 anti-trypsin mRNA was performed to verify the 
presence of RNA in each sample analyzed. The coding 
35 region of the alpha-1 anti-trypsin gene is described in 
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Rosenberg t al. (1984) . Synthetic oligomer 16-mer prim- 

rs designed to amplify a 365 nucle tide fragment of the 
coding region f the alpha*! antitrypsin g ne were 
derived from nucleotides 22-37 (sense) and nucleotides 
5 372-387 (antisense) . The PCR products were detected 

using a 32 P nick-translated probe which lies between, and 
not including, the cDNA/PCR primer sequences* 

Due to the extreme sensitivity of the PCR re- 
action, all samples were run a minimum of three times. 

10 All false positive signals were eliminated when the fol- 
lowing precautions were taken: 1) eliminating aerosols by 
using screw capped tubes with rubber O-ring seals; 2) 
pipetting with Renin MXCROKANe positive displacement 
pipetters with disposable pistons/capillaries; and 3) 

15 selecting the oligonucleotide sequences for the cDKA and 
PCR primers from two non-contiguous cDKA clones. 

Industrial utility . 

The immunoreactive compositions of the 
20 invention, have utility in the preparation of materials, 
for example, vaccines, which in turn may be used for the 
treatment of individuals against HCV infections, 
particularly chronic HCV infections. In addition, the 
compositions may be used to prepare materials for the 
25 detection of multiple variants of HCV in biological 

samples. For example, the immunoreactive compositions of 
the present invention can be used to generate polyclonal 
antibody compositions that recognize more than one HCV 
isolate, or as the antigen in an anti-HCV antibody 
30 immunoassay. The latter method can be used to screen 
blood products for possible HCV contamination. 
Polyclonal antiserum or antibodies can be used to for 
passive immunization of an individual. 
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CLAIMS 

1. An immunogenic polypeptide composition comprising at 
least two HCV amino acid sequences, each HCV sequence 
comprising at least one epitope within a variable domain 
5 of an HCV envelope protein, wherein the variable domain 
regions of the amino acid sequences are heterogeneous 
with each other and are derived from distinct HCV 
isolates . 



10 2. An immunogenic composition according to claim 1 

comprising a plurality of antigen sets, wherein (a) each 
antigen set consists of a plurality of substantially 
identical sequences comprising at least one epitope 
within a variable domain of an HCV polypeptide, and (b) 

15 the amino acid sequence of the epitope of one set is 

heterogeneous with respect to the amino acid sequence of 
at least one other set. 

3. An immunogenic composition according to claim 1 or 2 
20 wherein the distinct HCV isolates include an HCV group I 

isolate and an HCV group II isolate. 

4. An immunogenic composition according to any one of 
claims 1 to 3 wherein the variable domain is within the 

25 E2/NS1 protein. 

5. An immunogenic composition according to claim 4 
wherein the variable domain is encoded from about amino 
acid 384 to about amino acid 414 of the HCV polyprotein 

30 of Figure 9. 

6. An immunogenic composition according to any one of 
claims 1 to 3 wherein the variable domain is within the 
El protein. 
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7. An immunogenic composition according to claim 6 
wherein the variable domain is encoded from about amino 
acid 215 to about acid 255 of the HCV polyprotein of 
Figure 9. 

5 

8. An immunogenic composition according to any one of 
claims 1 to 3 wherein each amino acid sequence further 
comprises an epitope within a second variable domain of 
an HCV polypeptide, wherein the second variable domain 

10 regions of the amino acid sequences are heterogeneous 
with each other and are derived from distinct HCV 
isolates . 

9. An immunogenic composition according to claim 8 
15 wherein the first variable domain is within the E2/NS1 

protein and second variable domain is within the El 
protein- 
ic A method for preparing an immunogenic composition 
20 for treatment of HCV comprising mixing an immunogenic 
composition with a suitable excipient characterized in 
that said immunogenic composition is as defined in any. 
one of claims 1 to 9. 

25 11. A method for producing anti-HCV antibodies 
.comprising (i) using an effective amount of an 
immunogenic composition according to any one of claims 1 
to 9 in a mammal; and (ii) optionally purifying said 
antibodies from the blood of said mammal. 

30 

12. A polyclonal antibody composition made according to 
the method of claim 11. 

13. A method of detecting antibodies to HCV within a 
35 biological sample comprising: 
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(a) providing a biological sample suspected of 
containing antibodies to multiple strains of HCV; 

(b) providing a polypeptide reagent; 

(c) reacting the biological sample of (a) with the 
5 polypeptide reagent of (b) under conditions which allow 

the formation of antigen-antibody complexes; and 

(d) detecting the formation of complexes formed 
between the antigen of (a) and the antibodies of the 
biological sample of (b) , if any; 

10 characterized in that as a polypeptide reagent it 

contains a immunoreactive composition according to any 
one of claims 1 to 9. 

14. A kit for detecting antibodies to multiple strains 
15 of HCV within a biological sample comprising a diagnostic 
reagent and suitable packaging characterized in that as a 
diagnostic reagent it contains an immunoreactive 
composition according to any one of claims 1 to 9. 

20 15. A DNA molecule encoding a polypeptide comprising two 
antigenic heterogeneous amino acid sequences from the 
same variable domain of distinct HCV isolates, wherein 
the variable domain is selected from the group consisting 
of the El or E2/NS1 domains. 

25 

16. A host cell comprising a DNA molecule according to 
claim 15. 

17. A host cell according to claim 16 wherein the DNA 
30 molecule comprises control sequences that are capable of 

causing the expression of the polypeptide. 

18. A method of making a recombinant protein comprising 
growing a population of host cells according to claim 17 

35 under conditions that provide for the expression of the 
polypeptide . 
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